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Chapter 1 

Basic Ideas 



In the end, all mathematics can be boiled down to logic and set theory. Because 
of this, any careful presentation of fundamental mathematical ideas is inevitably 
couched in the language of logic and sets. This chapter defines enough of that 
language to allow the presentation of basic real analysis. Much of it will be 
familiar to you, but look at it anyway to make sure you understand the notation. 

1.1 Sets 

Set theory is a large and complicated subject in its own right. There is no time 
in this course to touch on any but the simplest parts of it. Instead, we'll just 
look at a few topics from what is often called "naive set theory." 
We begin with a few definitions. 

A set is a collection of objects called elements. Usually, sets are denoted by 
the capital letters A, B, ■ ■ ■ , Z . A set can consist of any type and number of 
elements. Even other sets can be elements of a set. The sets dealt with here 
usually have real numbers as their elements. 

If a is an element of the set A, we write a € A. If a is not an element of the 
set A, we write a ^ A. 

If all the elements of A are also elements of B, then ^ is a subset of B. In 
this case, we write A C B ot B D A. In particular, notice that whenever ^4 is a 
set, then A c A. 

Two sets A and B are equal, if they have the same elements. In this case we 
write A = B. It is easy to see that A = B iff A C B and B C A. Estabhshing 
that both of these containments are true is the most common way to show that 
two sets are equal. 

U A C B and A ^ B, then A is a proper subset of B. In cases when this is 
important, it is written A C B instead of just A C B. 
There are several ways to describe a set. 

A set can be described in words such as "P is the set of all presidents of the 
United States." This is cumbersome for complicated sets. 
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All the elements of the set could be listed in curly braces as S = {2, 0, a}. If 
the set has many elements, this is impractical, or impossible. 

More common in mathematics is set builder notation. Some examples are 

P = {p : p is a president of the United states} 

= {Washington, Adams, Jefferson, • • • , Clinton, Bush, Obama} 

and 

^ = {n : n is a prime number} = {2, 3, 5, 7, 11, • • • }. 
In general, the set builder notation defines a set in the form 

{formula for a typical element : objects to plug into the formula}. 

A more complicated example is the set of perfect squares: 

S = {n^ : n is an integer} = {0, 1, 4, 9, • • • }. 

The existence of several sets will be assumed. The simplest of these is the 
empty set, which is the set with no elements. It is denoted as 0. The natural 
numbers is the set N = {1, 2, 3, • • • } consisting of the positive integers. The set 
Z = {• • • , —2, —1, 0, 1, 2, • • • } is the set of all integers, ui = {n & Z : n > 0} = 
{0, 1, 2, • • • } is the nonnegative integers. Clearly, 0 C A, for any set A and 

0 C N C w C Z. 

Definition 1.1.1. Given any set A, the power set of A, written '?{A), is the 
set consisting of all subsets of A; i. e., 

•?{A) = {B:BcA}. 

For example, J'({a, b}) = {0, {a}, {b}, {a, b}}. Also, for any set A, it is always 
true that 0 e y(A) and A € J'(A). If a e ^, it is never true that a € but 
it is true that {a} C y{A). Make sure you understand why! 

An amusing example is J'(0) = {0}. (Don't confuse 0 with {0}! The former 
is empty and the latter has one element.) Now, consider 

a'(0) = {0} 
W)) = {0,W} 

a'(T(a'(0))) = {0,{0},{{0}},{0,{0}}} 

After continuing this n times, for some n e N, the resulting set, 

nn---m ■■■)), 

has 2"~^ elements. 
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A B A B 



Figure 1.1: These are Venn diagrams showing the four standard binary operations on 
sets. In this figure, the set which results from the operation is shaded. 

1.2 Algebra of Sets 

Let A and B be sets. There are four common binary operations used on sets.^ 
The union of A and B is the set containing aU the elements in either A or 

B: 

A\J B ^ {x : X & Ay X e B). 

The intersection of A and B is the set containing the elements contained in 
both A and B: 

AC^B = {x■. x^AAx^B}. 

The difference of A and B is the set of elements in A and not in B: 
A\B = {x : X e A Ax ^ B). 

The symmetric difference of A and B is the set of elements in one of the sets, 
but not the other: 

Al^B ^{A\JB) \ {AC^B). 

Another common set operation is complementation. The complement of a 
set A is usually thought of as the set consisting of all elements which are not in A. 
But, a little thinking will convice you this is not a meaningful definition because 
the collection of elements not in A is not a precisely understood collection. To 

^In the following, some logical notation is used. The symbol V is the logical nonexclusive 
"or." The symbol A is the logical "and." Their truth tables are as follows: 
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make sense of the complement of a set, there miist be a well-defined universal 
set U which contains all the sets in question. Then the complement of a set 
A c U is A"^ = U \ A. It is usually the case that the universal set U is evident 
from the context in which it is used. 

With these operations, an extensive algebra for the manipulation of sets can 
be developed. It's usually done hand in hand with formal logic because the two 
subjects share much in common. These topics are studied as part of Boolean 
algebra. Several examples of set algebra are given in the following theorem and 
its corollary. 

Theorem 1.2.1. Let A, B and C he sets. 

(a) A\{B[JC) = {A\B)f^{A\C) 

(b) A\{Br^C) = {A\B)yj{A\C) 

Proof, (a) This is proved as a sequence of equivalences.^ 

x& A^x^ (5UC) 
x^AAx^BAx^C 
{xeAAx^B)A{xeAAx^C) 

xe{A\B)r\{A\C) 

(b) This is also proved as a sequence of equivalences. 

xGA\{BnC) ^ xeAAx(^iBnC) 

^=> xgAa{x^B\/x^C) 

<^ {xgAAx^B)v{xgAAx^C) 

<^ xe{A\B)U{A\C) 

□ 

Theorem 1.2.1 is a version of a group of set equations which are often called 
DeMorgan's Laws. The more usual statement of DeMorgan's Laws is in Corol- 
lary 1.2.2. Corollary 1.2.2 is an obvious consequence of Theorem 1.2.1 when 
there is a universal set to make the complementation well-defined. 

Corollary 1.2.2 (DeMorgan's Laws). Let A and B be sets. 

(a) (AU By = A^'nB" 

(b) {An BY ^A^UB" 

^The logical symbol ■^=> is the same as "if, and only if." If A and B are any two 
statements, then A B is the same as saying A implies B and B implies A. It is also 

common to use iff in this way. 



x€A\{BUC) 
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1.3 Indexed Sets 

We often have occasion to work with large collections of sets. For example, we 
could have a sequence of sets Ai, A2, A3, • • • , where there is a set An associated 
with each n e N. In general, let A be a set and suppose for each A € A there is 
a set Ax. The set {A\ : A S A} is called a collection of sets indexed by A. In 
this case, A is called the indexing set for the collection. 

Example 1.3.1. For each n G N, let A„ = {fc e Z : A:^ < n}. Then 

Ai = A2 =A3 = {-1, 0, 1}, A4 = {-2, ~1, 0, 1,2}, • • • , 

A50 = {-7, -6, -5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 6, 7}, • • • 

is a collection of sets indexed by N. 

Two of the basic binary operations can be extended to work with indexed 
collections. In particular, using the indexed collection from the previous para- 
graph, we define 

\^ Ax = {x : X € Ax for some A e A} 

and 

Ax = {x : X € Ax for all A e A}. 

AeA 

DeMorgan's Laws can be generalized to indexed collections. 

Theorem 1.3.1. If {Bx : X G A} is an indexed collection of sets and A is a 
set, then 

A\\jBx=f]iA\Bx) 
AeA AeA 

and 

A\f]Bx=[j{A\Bx). 
AeA AeA 

Proof. The proof of this theorem is Exercise 1.3. □ 

1.4 Functions and Relations 
1.4.1 Tuples 

When listing the elements of a set, the order in which they arc listed is unimpor- 
tant; e. g., {e,l,v,i, s} = {l,i,v, e, s}. If the order in which n items are listed 
is important, the list is called an n-tuple. (Strictly speaking, an n-tuple is not 
a set.) We denote an n-tuple by enclosing the ordered list in parentheses. For 
example, if Xi,X2,xz, X4 are 4 items, the 4-tuple (xi, a;2, 2:3, 0:4) is different from 
the 4-tuple {x2,xi,xz,X4). 

Because they are used so often, 2-tuples are called ordered pairs and a 3-tuple 
is called an ordered triple. 
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Definition 1.4.1. The Cartesian product of two sets A and B is the set of all 

ordered pairs 

Ax B = {{a, b) laGAAbG B}. 
Example 1.4.1. If A = {a, b, c} and B = {1, 2}, then 

AxB = {{a, 1), (a, 2), (6, 1), (6, 2), (c, 1), (c, 2)}. 

and 

BxA= {(1, a), (1, b), (1, c), (2, a), (2, 6), (2, c)}. 

Notice that A x B ^ B x A because of the importance of order in the ordered 
pairs. 

A useful way to visualize the Cartesian product of two sets is as a table. 
The Cartesian product ^4x5 from Example 1.4.1 is listed as the entries of the 
following table. 





1 


2 


a 


(a,l) 


(a, 2) 


b 


(M) 


(^2) 


c 


(c,l) 


(c,2) 



Of course, the common Cartesian plane from your analytic geometry course 
is nothing more than a generalization of this idea of listing the elements of a 
Cartesian product as a table. 

The definition of Cartesian product can be extended to the case of more 
than two sets. If {Ai,A2,- ■ ■ , An} are sets, then 

Ai X A2 X ■ ■ ■ X An = {(ai,a2, • • • , a„) : a/j € for 1 < fc < n} 

is a set of n-tuples. This is often written as 

n 

Y[Ak = AiXA2X---xAn. 
k=l 

1.4.2 Relations 

Definition 1.4.2. If A and B arc sets, then any R C A x B is a. relation from 
A to B. If (a, b) G R, we write aRb. 
In this case, 

dom (i?) = {a : (a, b) G R} 

is the domain of R and 

ran (i?) = {b : (a, b) G R} 

is the range of R. 
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In the special case when Rc Ax A,ioT some set A, there is some additional 
terminology. 

R is symmetric, if aRb <s=^ bRa. 

R is reflexive, if ai?a whenever a € dom(A). 
R is transitive, if ai?6 A 6i?c ai?c. 

ii is an equivalence relation on A, if it is symmetric, reflexive and transitive. 

Example 1.4.2. Let R be the relation on Z x Z defined by aRb <s=^ a < b. 
Then i? is reflexive and transitive, but not symmetric. 

Example 1.4.3. Let R be the relation on Z x Z defined by aRb a < b. 

Then R is transitive, but neither reflexive nor symmetric. 

Example 1.4.4. Let R be the relation on Z x Z defined by aRb <s=^ a^ = b^. 
In this case, R is an equivalence relation. It is evident that aRb iff 6 = a or 
b = —a. 

1.4.3 Functions 

Definition 1.4.3. A relation R c A x B is a, function if 

aRbi A aRb2 => bi = 62- 

If / C A X B is a function and dom (/) = A, then we usually write f : A ^ B 
and use the usual notation /(a) = b instead of afb. 

If f : A ^ B is a. function, the usual intuitive interpretation is to regard / 
as a rule that associates each clement of A with a unique element of B. It's not 
necessarily the case that each element of B is associated with something from 
A; i.e., B may not be ran (/). 

Example 1.4.5. Define / : N Z by /(n) = and : Z ^ Z by g{n) = n^. In 

this case ran (/) = {r? : n G N} and ran (g) = ran (/) U {0}. Notice that even 
though / and g use the same formula, they are actually different functions. 

Definition 1.4.4. \i ] : A^ B and g : B ^ C, then the com,position of g with 
/ is the function g o f : A ^ C defined by g o /(a) = g(f(a}). 

In Example 1.4.5, g o /(n) = g{f{n)) = g{n^) = {n^)^ = makes sense for 
all n € N, but f o g is undefined at n = 0. 

There are several important types of functions. 

Definition 1.4.5. A function f : A ^ B is a constant function, if ran(/) has 

a single element; i. c., there is a & G i? such that /(a) = b for all a € A. The 
function / is surjective (or onto B), if ran (/) = B. 

In a sense, constant and surjective functions are the opposite extremes. A 
constant function has the smallest possible range and a surjective function has 
the largest possible range. Of course, a function f : A ^ B can be both constant 
and surjective, if B has only one element. 
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Figure 1.2: These diagrams show two functions, f : A B and g : A ^ B. The 
function g is injective and / is not because /(a) = /(c). 

Definition 1.4.6. A function f : B \s injective (or one-to-one), if /(a) = 
f{b) implies a = b. 

The terminology "one-to-one" is very descriptive because such a function 
uniquely pairs up the elements of its domain and range. An illustration of this 
definition is in Figure 1.2. In Example 1.4.5, / is injective while g is not. 

Definition 1.4.7. A function f : A ^ B is bijective, if it is both surjective and 
injective. 

A bijective function can be visualized as uniquely pairing up all the elements 

of A and B. Some authors, favoring less pretentious language, use the more 
descriptive terminology one-to-one correspondence instead of bijection. This 
pairing up of the elements from each set is like counting them and finding they 
have the same number of elements. Given any two sets, no matter how many 
elements they have, the intuitive idea is they have the same number of elements 
if, and only if, there is a bijection between them. 

The following theorem shows that this property of counting the number of 
elements works in a familiar way. (Its proof is left as an easy exercise.) 

Theorem 1.4.1. If f : A ^ B and g : B ^ C are bisections, then gof : A^ C 
is a bijection. 
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/ 




Figure 1.3: This is one way to visualize a general invertible function. First / does 



1.4.4 Inverse Functions 

Definition 1.4.8. If f : A ^ B, C C A and D C B, then the image of 
C is the set /(C) = {/(a) : a £ C}. The inverse image of D is the set 



Definitions 1.4.7 and 1.4.8 work together in the following way. Suppose 
f : A ^ B is bijective and b G B. The fact that / is surjective guarantees 
that f~^{{b}) 0. Since / is injective, f~^{{b}) contains only one element, say 
a, where /(a) = b. In this way, it is seen that is a rule that assigns each 
element of B to exactly one element of A; i. e., is a function with domain 
B and range A. 

Definition 1.4.9. If / : A _B is bijective, the inverse of / is the function : 
B A with the property that o /(a) = a for all a S A and / o f~^{b) = b 
for all beB. 

There is some ambiguity in the meaning of between 1.4.8 and 1.4.9. The 

former is an operation working with siibsets of A and B: the latter is a function 
working with elements of A and B. It's usually clear from the context which 
meaning is being used. 

Example 1.4.6. Let A = N and B be the even natural numbers. If f : A ^ B 

is f{n) = 2n and g : B A is gin) = n/2, it is clear / is bijective. Since 
fog{n) = /(n/2) = 2n/2 = n and (/o/(n) = g{2n) = 2n/2 = n, we see g = f~^. 
(Of course, it is also true that / = g~^.) 

Example 1.4.7. Let / : N Z be defined by 



something to a and then / ^ undoes it. 



/ 



\D)^{a:f{a)eD}. 




(n-l)/2, nodd. 



n even 



It's quite easy to see that / is bijective and 
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Given any set A, it's obvious there is a bijection f : A ^ A and, ii g : A ^ B 
is a bijection, then so is : B ^ A. Combining these observations with 
Theorem 1.4.1, an easy theorem fohows. 

Theorem 1.4.2. Let § be a collection of sets. The relation on S defined by 

A ^ B there is a bijection f : A ^ B 

is an equivalence relation. 

1.4.5 Schroder-Bernstein Theorem 

The following theorem is a powerful tool in set theory, and shows that a seem- 
ingly intuitively obvious statement is sometimes difficult to verify. It will be 
used in Section 1.5. 

Theorem 1.4.3 (Schroder-Bernstein"^). Let A and B be sets. If there are in- 
jective functions f : A ^ B and g : B ^ A, then there is a bijective function 
h:A^B. 



A 




B 



Figure 1.4: Here are the first few steps from the construction used in the proof of 
Theorem 1.4.3. 



Proof. Let Bi= B\ f{A). U Bk C B is defined for some fc e N, let Ak = g{Bk) 
and Bk+i — f{Ak). This inductively defines A^. and B^ for all k G N. Use these 
sets to define A = [Jj^^jq Ak and h : A B a.s 



g ^(x), X e A 
fix), xeA\A 



h{x) 

It must be shown that h is well-defined, injective and surjective. 



•^This is often called the Cantor-Schroder-Bernstein or Cantor-Bernstein Theorem, despite 
the fact that it was apparently first proved by Richard Dedekind. 



June 2, 2014 



http;//math.louisville.edu/'^lee/ira 



1.5. CARDINALITY 



1-11 



To show h is wcU-dcfincd, let x G A. If a; € A\A, then it is clear h{x) = f{x) 
is defined. On the other hand, it x £ A, then x G A^ for some k. Since 
X G Ak = g{Bk), we see h{x) = g~^{x) is defined. Therefore, h is well-defined. 

To show h is injective, let x,y € A with x ^ y. 

If both x,y G A ov x,y S A \ A, then the assumptions that g and / are 
injective, respectively, imply h{x) ^ h{y). 

The remaining case is when x E A and y G A \ A. Suppose x G Ak and 
h{x) = If = 1, then h{x) = g'^x) G Bi and h{y) = f{y) G f{A) = 

B \ Bi. This is clearly incompatible with the assumption that h{x) = h{y). 
Now, suppose A; > 1. Then there is an xi G Bi such that 

X = gofogofo---ofo g{xi). 
A; — 1 /'s and k g's 

This implies 

h{x) = g~^{x) = fogo fo---o fo g{xi) = f{y) 

'• ' 

fc — 1 /'s and fc — 1 g's 

so that 

y = g°f°gofo---°f° g{xi) g Ak-i c A. 

V ' 

k — 2 /'s and k — 1 g's 

This contradiction shows that h{x) ^ h{y). We conclude h is injective. 

To show h is surjective, let y G B. It y G B^ for some k, then h{Ak) = 
g~^{Ak) = Bk shows y G h{A). It y ^ Bk for any k, y G f{A) because 
Bi = B \ f{A), and g{y) ^ A, so y = h{x) = f{x) for some x G A. This shows 
h is surjective. □ 

The Schroder-Bernstein theorem has many consequences, some of which are 
at first a bit unintuitive, such as the following theorem. 

Corollary 1.4.4. There is a bijective function /i : N — >■ N x N 

Proof. If / : N — >■ N X N is f{n) = {n, 1), then / is clearly injective. On the other 
hand, suppose : N x N — )• N is defined by g{{a, b)) = 2^3^. The uniqueness of 
prime factorizations guarantees g is injective. An application of Theorem 1.4.3 

yields h. □ 

To appreciate the power of the Schroder-Bernstein theorem, try to find an 
explicit bijection /i : N N x N. 

1.5 Cardinality 

There is a way to use sets and functions to formalize and generalize how we 
count. For example, suppose we want to count how many elements are in the 
set {a, b, c}. The natural way to do this is to point at each element in succession 
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and say "one, two, three." What is really happening is that we're defining 
a bijective function between {a,b,c} and the set {1,2,3}. This idea can be 
generalized. 

Definition 1.5.1. Given n G N, the set n = {1,2, •• • ,n} is called an initial 
segment of N. The trivial initial segment is 0 = 0. A set S has cardinality n, 
if there is a bijective function f : S ^n. In this case, we write card (S) = n. 

The cardinalities defined in Definition 1.5.1 are called the finite cardinal 
numbers. They correspond to the everyday counting numbers we usually use. 
The idea can be generalized still further. 

Definition 1.5.2. Let A and B be two sets. If there is an injective function 
f : A ^ B, we say card (A) < card (B). 

According to Theorem 1.4.3, the Schroder-Bernstein Theorem, if card (A) < 

card (B) and card (B) < card (A), then there is a bijective function f : A ^ B. 
As expected, in this case we write card {A) = card {B) . When card {A) < 
card(B), but no such bijection exists, we write card(^) < card(B). Theorem 
1.4.2 shows that card {A) = card [B) is an equivalence relation between sets. 

The idea here, of course, is that card {A) = card {B) means A and B have 
the same number of elements and card {A) < card (B) means ^4 is a smaller set 
than B. This simple intuitive understanding has some surprising consequences 
when the sets involved do not have finite cardinality. 

In particular, the set A is countably infinite, if card(^) = card(N). In this 
case, it is common to write card (N) = Hq.'* When card (A) < Hq, then A is said 
to be a countable set. In other words, the countable sets are those having finite 
or countably infinite cardinality. 
Example 1.5.1. Let / : N ^ Z be defined as 



It's easy to show / is a bijection, so card (N) = card (Z) = Kq. 
Theorem 1.5.1. Suppose A and B are countable sets. 

(a) Ax B is countable. 

(b) Au B is countable. 

Proof, (a) This is a consequence of Theorem 1.4.4. 

(b) This is Exercise 1.19. □ 

An alert reader will have noticed from previous examples that 

Ho = card (Z) = card (uj) = card (N) = card (N x N) = card (N x N x N) = • • • 

A logical question is whether all sets either have finite cardinality, or are count- 
ably infinite. That this is not so is seen by letting 5 = N in the following 



*The symbol K is the Hebrew letter "aleph" and Kq is usually pronounced "aleph nought." 




when n is odd 



when n is even 



theorem. 
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Theorem 1.5.2. If S is a set, caxd{S) < card(T(5)). 

Proof. Since {a} G ^(5) for all a G 5, it follows that card (5) < card(?(S')). 
Therefore, it suffices to prove there is no surjective function / : S* — > ^'(S'). 

To see this, assume there is such a function / and letT = {x G S : x ^ f{x)}- 
Since / is surjective, there is a t G T such that f{t) = T. Either t <eT or t ^T. 

If f G T = /(T), then the definition of T implies t ^ T, a, contradiction. On 
the other hand, if i ^ T = /(T), then the definition of T implies t e T. These 
contradictions lead to the conclusion that no such function / can exist. □ 

A set S is said to be uncountably infinite, or just uncountable, if Ho < card (S). 
Theorem 1.5.2 implies Hq < card(IP(N)), so 5'(N) is uncountable. In fact, the 
same argument implies 

Ho = card (N) < card (T(N)) < card (^(^(N))) < • • • 

So, there are an infinite number of distinct infinite cardinalities. 

In 1874 Georg Cantor [5] proved card (M) = card (^(N)) and card (M) > Hq, 
where M is the set of real numbers. (A version of Cantor's theorem appears in 
Theorem 2.4.3 below.) This naturally led to the question whether there are sets 
S such that Hq < card (S) < card (M). Cantor spent many years trying answer 
this question and never succeeded. His assumption that no such sets exist came 
to be called the continuum hypothesis. 

The importance of the continuum hypothesis was highlighted by David 
Hilbert at the 1900 International Congress of Mathematicians in Paris, when 
he put it first on his famous list of the 23 most important open problems in 
mathematics. Kurt Godel proved in 1940 that the continuum hypothesis can- 
not be disproved using standard set theory, but he did not prove it was true. 
In 1963 it was proved by Paul Cohen that the continuum hypothesis is actually 
unprovable as a theorem in standard set theory. 

So, the continuum hypothesis is a statement with the strange property that 
it is neither true nor false within the framework of ordinary set theory. This 
means that in the standard axiomatic development of set theory, the contin- 
uum hypothesis, or a suitable negation of it, can be taken as an additional 
axiom without causing any contradictions. The technical terminology is that 
the continuum hypothesis is independent of the axioms of set theory. 

The proofs of these theorems are extremely difficult and entire broad areas of 
mathematics were invented just to make their proofs possible. Even today, there 
are some deep philosophical questions swirling around them. A more technical 
introduction to many of these ideas is contained in the book by Ciesiclski [7]. 
A nontechnical and very readable history of the efforts by mathematicians to 
understand the continuum hypothesis is the book by Aczel [1]. 

1.6 Exercises 

1.1. If a set S has n elements for n G w, then how many elements are in J'(S')? 
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1.2. Prove that for any sets A and B, 

(a) A = {AnB)U{A\B) 

(b) AUB = {A\B)[J{B\A)[J{AnB) and that the sets A\B, B\ A and 
An B are pairwise disjoint. 

(c) A\B = AnB''. 

1.3. Prove Theorem 1.3.1. 

1.4. For any sets A, B, C and D, 

(A X S) n (C X £)) = (A n C) X (B n D) 

and 

(A X B) U (C X £)) C (A U C) X (B U D). 
Why does equality not hold in the second expression? 

1.5. Prove Theorem 1.4.2. 

1.6. Suppose R is an equivalence relation on A. For each x G A define 

Cx = {y & A : xRy}. Prove that if x,y € A, then cither Cx = Cy or Cx<~^Cy = 0. 
(The collection {Cx ■ x € A} is the set of equivalence classes induced by R.) 

1.7. If / : ^ — ^ i? and g : B ^ C are bijections, then so is g o f : A ^ C. 

1.8. Prove or give a counter example: f : X Y is injective iff whenever A 
and B are disjoint subsets of Y, then f~^{A) n f~^{B) = 0. 

1.9. If / : ^ — ^ B is bijective, then is unique. 

1.10. Prove that f : X ^Y is surjective iff for each subset Ad X, Y\f{A) c 
/(^\^)- 

1.11. Suppose that Ak is a set for each positive integer k. 

(a) Show that x e H^i (Ufeln ^k) i^ x & A^ for infinitely many sets A^. 

(b) Show that x G U^i (DfeLn ^k) iff a; e for all but finitely many of the 
sets Ak- 

The set (UfeLn ^fc) from (a) is often called the superior limit of the sets 

Ak and [J^^i (Plfcln ^fe) often called the inferior limit of the sets Ak- 

1.12. Given two sets A and _£?. it is common to let A^ denote the set of all 
functions f : B ^ A. Prove that for any set A, card (^2^^^ = card(J'(^)). This 
is why many authors use 2^ as their notation for 'J'{A). 
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1.13. Let 5 be a set. Prove the following two statements are equivalent: 

(a) S is infinite; and, 

(b) there is a proper subset T of S and a bijection / : 5 — >■ T. 
This statement is often used as the definition of when a set is infinite. 

1.14. If S is an infinite set, then there is a countably infinite collection of 
nonempty pairwise disjoint infinite sets T„, n e N such that S = UnGN^"- 

1.15. Without using the Schroder-Bernstein theorem, find a bijection / : 
[0,1] ^(0,1). 

1.16. Find a function / : K \ {0} -S- K \ {0} such that /"^ = 1//. 

1.17. Find a bijection / : [0, oo) (0, oo). 

1.18. If / : A — )• iJ and g : B ^ A are functions such that / o g[x) = x for all 
X G B and g o f(x) = x for all x G A, then = g. 

1.19. If A and B are sets such that card (A) = card(i3) = Hq, then 
card (A US) = Kq. 

1.20. Using the notation from the proof of the Schroder-Bernstein Theorem, 
let A = N, S = Z, f{n) = n and 



Calculate h{6) and h{7). 

1.21. Suppose that in the statement of the Schroder-Bernstein theorem A = 
B = Z and f{n) = g{n) = 2n. Following the procedure in the proof yields what 
function /i? 

1.22. If {An : n e N} is a collection of countable sets, then UneN^" 
countable. 

1.23. If A and B arc sets, the set of all functions f : A ^ B is often denoted 
by B^. If S' is a set, prove that card (S^) = card 



1.24. If Ho < card (S)), then there is an injective function f : S ^ S that is 

not surjective. 

1.25. If card(S') = Hg, then there is a sequence of pairwise disjoint sets 
T„, n e N such that card (T„) = Hq for every n e N and UneN = 




1 - 3n, n < 0 
3n - 1, n > 0 
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Chapter 2 

The Real Numbers 



This chapter concerns what can be thought of as the rules of the game: the 
axioms of the real numbers. These axioms imply all the properties of the real 
numbers and, in a sense, any set satisfying them is uniquely determined to be 
the real numbers. 

The axioms are presented here as rules without very much justification. 
Other approaches can be used. For example, a common approach is to be- 
gin with the Peano axioms — the axioms of the natural numbers — and build 
up to the real numbers through several "completions" of the natural numbers. 
It's also possible to begin with the axioms of set theory to build up the Peano 
axioms as theorems and then use those to prove our axioms as further theorems. 
No matter how it's done, there are always some axioms at the base of the struc- 
ture and the rules for the real numbers are the same, whether they're axioms 
or theorems. 

We choose to start at the top because the other approaches quickly turn into 
a long and tedious labyrinth of technical exercises without much connection to 
analysis. 

2.1 The Field Axioms 

These first six axioms are called the field axioms because any object satisfying 
them is called a field. They give the algebraic properties of the real numbers. 

A field is a nonempty set F along with two binary operations, multiplication 
X : F X F — ^ F and addition -|- : F x F — ^ F satisfying the following axioms.^ 

Axiom 1 (Associative Laws). If a,b,c G F, then {a + b) + c = a + {b + c) and 

{a X b) X c ^ a X (b X c). 

^ Given three sets A, B and C, a function f : A X B ^ C is called a binary operation. In 
other words, a binary operation is just a function with two arguments. The standard notations 
o{ +{a,b) = a + b and x (a, 6) = a x 6 are used here. The symbol x is unfortunately used for 
both the Cartesian product and the field operation, but the context in which it's used removes 
the ambiguity. 
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Axiom 2 (Commutative Laws). If a, 6 e F, tlien a + b = b + a and ax b = bx a. 

Axiom 3 (Distributive Law). If a,b,cG F, then ax {b + c) = {a x b) + {a x c). 

Axiom 4 (Existence of identities). Tiiere are 0, 1 S F such that a + 0 = a and 
a X 1 = a, for all a e F. 

Axiom 5 (Existence of an additive inverse). For each a S F there is — a G F 
such that a + (—a) = 0. 

Axiom 6 (Existence of a multiplicative inverse) . For each a G F \ {0} there is 
a~^ e F such that a x a~^ = 1. 

Although these axioms seem to contain most properties of the real numbers 
we normally use, they don't characterize the real numbers; they just give the 
rules for arithmetic. There are other fields besides the real numbers and studying 
them is a large part of most abstract algebra courses. 

Example 2.1.1. Prom elementary algebra we know that the rational numbers 

Q = {p/q : p e z A 9 e N} 

form a field. It is shown in Theorem 2.3.1 that \/2 ^ Q, so Q doesn't contain 

all the real numbers. 

Example 2.1.2. Let F = {0,1,2} with addition and multiplication calculated 
modulo 3. It is easy to check that the field axioms are satisfied. This field is 
usually called Z3. 

The following theorems, containing just a few useful properties of fields, are 
presented mostly as examples showing how the axioms are used. More complete 
developments can be found in any beginning abstract algebra text. 

Theorem 2.1.1. The additive and multiplicative identities of a field F are 
unique. 

Proof. Suppose ei and 62 are both multiplicative identities in F. Then 

ei = ei X 62 = 62, 

so the multiplicative identity is unique. The proof for the additive identity is 

essentially the same. □ 

Theorem 2.1.2. Let ¥ be a field. If a,b g¥ with b 0, then —a and b~^ are 
unique. 

Proof. Suppose bi and 62 are both multiplicative inverses for b^O. Then, using 
Axioms 4 and 1, 

61 = 61 X 1 = 61 X (6 X 62) = (&i X ^') X &2 = 1 X &2 = &2- 

This shows the multiplicative inverse in unique. The proof is essentially the 
same for the additive inverse. □ 
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There are many other properties of fields which could be proved here, but 
they correspond to the usual properties of the real numbers learned in beginning 
algebra, so we omit them. Some of them are in the exercises at the end of this 
chapter. 

From now on, the standard notations for algebra will usually be used; e. g., 
we will allow ab instead of a x 6 and a/b instead of a x b^^. The reader may 
also use the standard facts she learned from elementary algebra. 

2.2 The Order Axiom 

The axiom of this section gives the order and metric properties of the real 
numbers. In a sense, the following axiom adds some geometry to a field. 

Axiom 7 (Order axiom.). There is a set P C F such that 

(i) If a, & e P, then a + b,ab € P. 

(ii) If a e F, then exactly one of the following is true: a £ P, —a G P ov a = 0. 

Any field F satisfying the axioms so far listed is naturally called an ordered 
field. Of course, the set P is known as the set of positive elements of F. Using 
Axiom 7(ii), we see that F is divided into three pairwise disjoint sets: P, {0} 

and {—X : X E P}. The latter of these is, of course, the set of negative elements 
from F. The following definition introduces familiar notation for order. 

Definition 2.2.1. We write a < 6 or 6 > a, if 6 — a € P. The meanings of 
a<b and b>a are now as expected. 

Notice that a > 0 <^=^ a = a - 0 e P and a < 0 <^=^ -o = 0 - a e P, so 
a > 0 and a < 0 agree with our usual notions of positive and negative. 

Our goal is to capture all the properties of the real numbers with the ax- 
ioms. The order axiom eliminates many fields from consideration. For example. 
Exercise 2.5 shows the field Z3 of Example 2.1.2 is not an ordered field. On the 
other hand, facts from elementary algebra imply Q is an ordered field, so the 
first seven axioms still don't "capture" the real numbers. 

Following are a few standard properties of ordered fields. 

Theorem 2.2.1. Let F he an ordered field and a e F. a ^ 0 iff a'^ > 0. 

Proof. (^) If a > 0, then > 0 by Axiom 7(i). If a < 0, then -o > 0 by 
Axiom 7(ii) and = la^ = (-1)(-I)a2 = (_a)2 > 0. 

(<;=:) Since 0^ = 0, this is obvious. □ 

Theorem 2.2.2. // F is an ordered field and a,b,c G ¥, then 

(a) a < b a + c < b + a, 

(b) a < b Ab < c =4- a < c, 

(c) a < b A 0 0 ac < be, 
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(d) a < 6 A c < 0 ac > be. 

Proof, (a) a <b b-a e P (6 + c) - (a + c) e P a + c < b + c. 

(b) By supposition, both b — a,c — b £ P. Using the fact that P is closed 
under addition, we see {b — a) + {c — b) = c — a G P. Therefore, c> a. 

(c) Since b — aEP and c E P and P is closed under multiplication, c{b—a) = 
cb — ca dz P and, therefore, ac < be. 

(d) By assumption, b — a, —e € P. Apply part (c) and Problem 1. □ 

Theorem 2.2.3 (Two Out of Three Rule). Let ¥ be an ordered field and a,b,c€ 
¥. If ab = c and any two of a, b or c are positive, then so is the third. 

Proof. If o > 0 and b> 0, then Axiom 7(a) implies c > 0. Next, suppose a > 0 
and c > 0. In order to force a contradiction, suppose 6 < 0. In this case. Axiom 
7(ii) shows 

0 < a(-6) = -{ab) = -c < 0, 
which is impossible. □ 

Corollary 2.2.4. Let F be an ordered field and a € F. If a > 0, then a^^ > 0. 
Ifa<0, then < 0. 

Proof. The proof is Exercise 2.2. □ 

Suppose a > 0. Since la = a, Theorem 2.2.3 implies 1 > 0. Applying 
Theorem 2.2.2, we see that 1 + 1 > 1 > 0. It's clear that by induction, we can 
find a copy of N embedded in any ordered field. Similarly, Z and Q also have 
unique copies in any ordered field. 

The standard notation for intervals will be used on an ordered field, F; i. c., 
(a, b) = {x e¥ : a < X < b}, (a, oo) = {x G ¥ : a < x}, [a,b] = {x e¥ : a < X < 
b}, etc. It's also useful to start visualizing F as a standard number line. 

2.2.1 Metric Properties 

The order axiom on a field F allows us to introduce the idea of the distance 
between points in F. To do this, we begin with the following familiar definition. 

Definition 2.2.2. Let F be an ordered field. The absolute value function on F 
is a function I • I : F F defined as 




X, X >0 
—X, X <Q 



The most important properties of the absolute value function are contained 

in the following theorem. 

Theorem 2.2.5. Let ¥ be an ordered field and x,y & ¥. Then 
(a) |x| > 0 and \x\ = 0 <s=^ x = 0; 
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(b) = I — x\; 

(c) — |a;| < a; < |a;|; 

(d) \x\ < y —y<x< y; and, 

(e) |a; + y| < \x\ + \y\. 

Proof. (a) The fact that \x\ > 0 for all a; € F follows from Axiom 7(b). Since 

0 = —0, the second part is clear. 

(b) If a; > 0, then —x < 0 so that | — a;| = —{—x) = x = \x\. If a; < 0, then 
—X > 0 and \x\ = —x = | — a;|. 

(c) If a; > 0, then —\x\ = —x <x = \x\. If a; < 0, then —\x\ = —{—x) = x < 
—X = \x\. 

(d) This is left as Exercise 2.3. 

(e) Add the two sets of inequalities — |a;| < x < \x\ and —\y\ < y < \y\ to see 

+ \y\) < a; + y < |a:| + Now apply (d). 



From studying analytic geometry and calculus, we are used to thinking of 
|.T — j/| as the distance between the numbers x and y. This notion of a distance 
between two points of a set can be generalized. 

Definition 2.2.3. Let S* be a set and d : S x S ^ R satisfy 

(a) for all x,y & S, d{x, y) > 0 and d{x, y) = 0 <^=^ x = y, 

(b) for all x,y € S, d{x, y) — d{y, a;), and 

(c) for all x,y,z € S, d(x, z) < d(x, y) + d{y, z). 

Then the function d is a metric on S. The pair {S, d) is called a m,etric space. 

A metric is a function which defines the distance between any two points of 
a set. 

Example 2.2.1. Let 5 be a set and define d : S x S ^ S hy 



It can readily be verified that d is a metric on S. This simplest of all metrics is 
called the discrete metric and it can be defined on any set. It's not often useful. 

Theorem 2.2.6. If¥ is an ordered field, then d{x,y) = \x — y\ is a metric on 



The metric on F derived from the absolute value function is called the stan- 
dard metric on F. There are other metrics sometimes defined for specialized 
purposes, but we won't have need of them. 



□ 




F. 



Proof. Use parts (a), (b) and (e) of Theorem 2.2.5. 



□ 
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2.3 The Completeness Axiom 

All the axioms given so far are obvious from beginning algebra, and, on the 
surface, it's not obvious they haven't captured all the properties of the real 
numbers. Since Q satisfies them all, the following theorem shows that we're not 
yet done. 

Theorem 2.3.1. There is no a G Q such that c? = 2. 

Proof. Assume to the contrary that there is a e Q with = 2. Then there are 
p, g' e N such that a= p/q with p and q relatively prime. Now, 

=2 ^ p' = 2q^ (2.1) 

shows p^ is even. Since the square of an odd number is odd. p must be even; i. e., 
p = 2r for some r e N. Substituting this into (2.1), shows 2r^ = g^. The same 
argument as above establishes q is also even. This contradicts the assumption 
that p and q are relatively prime. Therefore, no such a exists. □ 

Since we suspect \/2 is a perfectly fine number, there's still something missing 
from the list of axioms. Completeness is the missing idea. 

The Completeness Axiom is somewhat more complicated than the previous 
axioms, and several definitions are needed in order to state it. 

2.3.1 Bounded Sets 

Definition 2.3.1. A subset S of an ordered field F is bounded above, if there 
exists M e F such that M > a; for all x G S. A subset S of an ordered field F is 
bounded below, if there exists m G F such that m < x for all x G S. The elements 
M and m are called an upper bound and lower bound for S, respectively. If S is 
bounded both above and below, it is a bounded set. 

There is no requirement in the definition that the upper and lower bounds 
for a set are elements of the set. They can be elements of the set, but typically 
are not. For example, if = (— oo, 0), then [0, oo) is the set of all upper bounds 
for TV, but none of them is in N . On the other hand, if T = (— oo, 0], then [0, oo) 
is again the set of all upper bounds for T, but in this case 0 is an upper bound 
which is also an element of T. An extreme case is 0. A vacuous argument shows 
every element of F is both an upper and lower bound, hut obviously none of 
them is in 0. 

A set need not have upper or lower bounds. For example N — (— oo, 0) has 
no lower bounds, while P — (0, oo) has no upper bounds. The integers, Z. has 
neither upper nor lower bounds. If S has no upper bound, it is unbounded, above 
and, if it has no lower bound, then it is unbounded below. In either case, it is 
usually just said to be unbounded. 

If M is an upper bound for the set S, then every x > M is also an upper 
bound for S. Considering some simple examples should lead you to suspect that 
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among the upper bounds for a set, there is one that is best in the sense that 
everything greater is an upper bound and everything less is not an upper bound. 
This is the basic idea of completeness. 

Definition 2.3.2. Suppose F is an ordered field and S is bounded above in F. 
A number B g¥ is called a least upper bound of S if 

(a) B is an upper bound for S, and 

(b) if a is any upper bound for S, then B < a. 

If S is bounded below in F, then a number 6 € F is called a greatest lower bound 

of S* if 

(a) 6 is a lower bound for S, and 

(b) if a is any lower bound for S, then b > a. 

Theorem 2.3.2. If¥ is an ordered field and A c¥ is nonempty, then A has 
at most one least upper bound and at most one greatest lower bound. 

Proof. Suppose ui and U2 are both least upper bounds for A. Since ui and U2 
are both upper bounds for A, Ui < U2 < Ui Ui = U2. The proof of the 

other case is similar. □ 

Definition 2.3.3. If A C F is nonempty and bounded above, then the least 
upper bound of A is written lub^. When A is not bounded above, we write 
lub A = oo. When A = 0, then lub A = -oo. 

If ^ C F is nonempty and bounded below, then the greatest lower of A is 
written gib A. When A is not bounded below, we write gib A = — oo. When 
A = 9, then gib A = oo.^ 

Notice that the symbol "oo" is not an element of F. Writing lub ^4 = oo is 
just a convenient way to say A has no upper bounds. Similarly lub0 = — oo 
tells us 0 has every real number as an upper bound. 

Theorem 2.3.3. Let A C ¥ and a e ¥ . a = lub A iff (a, oo) n A = 0 and for 
all e > 0, (a — £, a] n A ^ 0. Similarly, a — gib A iff (— oo, a) fl A = 0 and for 
alls>0, [a, a + £) n A 7^ 0. 

Proof. We will prove the first statement, concerning the least upper bound. The 
second statement, concerning the greatest lower bound, follows similarly. 

(^) If a; G (a, oo) D A, then a cannot be an upper bound of A, which is a 
contradiction. If there is an £ > 0 such that (a — £, a] n A = 0, then from above, 
we conclude (a — £, oo) fl A = 0. This implies a — e/2 is an upper boimd for A 
which is less than a = lub A. This contradiction shows {a — e,a] Ci A ^ 9. 

(<;=) The assumption that (a, oo) n A = 0 implies a > lub A. On the other 
hand, suppose lub A < a. By assumption, there is an a; G (lub A, a) (1 A. This 
is clearly a contradiction, since lub A < x G A. Therefore, a = lub A. □ 

^Some people prefer the notation sup A and inf A instead of lub A and gib A, respectively. 
They stand for the supremum and infimum of A. 
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An caglc-cycd reader may wonder why the intervals in Theorem 2.3.3 are 
{a — e, a] and [a, a + e) instead of {a — s, a) and (a, a + s). Just consider the 
case A = {a} to see that the theorem fails when the intervals are open. When 
\uhA ^ A OT gib A ^ A, the intervals can be open, as shown in the following 
corollary. 

Corollary 2.3.4. If A is bounded above and a = lubA ^ A, then for all 
£ > 0, {a — e, a) (1 A is an infinite set. Similarly, if A is bounded below and 
j3 = gib A ^ A, then for all £ > 0, {13, 13 + e) (1 A is an infinite set. 

Proof. Let £ > 0. According to Theorem 2.3.3, there is an a; i G {a — e,a] O A. 
By assumption, Xi < a. We continue by induction. Suppose n G N and Xn has 
been chosen to satisfy a;„ e (a — e, a) fl A. Using Theorem 2.3.3 as before to 
choose Xn+i G (x„,a) fl A. The set {a;„ : n e N} is infinite and contained in 

(a-£,a)nA. □ 

When F = Q, Theorem 2.3.1 shows there is no least upper bound for A = 
{x : x'^ < 2} in Q. In a sense, Q has a hole where this least upper bound should 
be. Adding the following completeness axiom enlarges Q to fill in the holes. 

Axiom 8 (Completeness). Every nonempty set which is bounded above has a 
least upper bound. 

This is the final axiom. Any field F satisfying all eight axioms is called a 
complete ordered field. Wc assume the existence of a complete ordered field, M, 
called the real numbers. 

In naive set theory it can be shown that if Fi and F2 are both complete 
ordered fields, then they are the same, in the following sense. There exists a 
unique bijective function i : Fi — > F2 such that i(a + b) — i{a) + i{b), i{ab) 

■ and a < h i{a) < i{b). Such a function i is called an order 

isomorphism. The existence of such an order isomorphism shows that M is 
essentially unique. More reading on this topic can be done in some advanced 
texts [9, 10]. 

Every statement about upper bounds has a dual statement about lower 
bounds. A proof of the following dual to Axiom 8 is left as an exercise. 

Corollary 2.3.5. Every nonempty set which is bounded below has a greatest 
lower bound. 

In Section 2.4 it will be shown that there is an a; € M satisfying = 2. This 
will show M removes the deficiency of Q highlighted by Theorem 2.3.1. The 
Completeness Axiom plugs up the holes in Q. 

2.3.2 Some Consequences of Completeness 

The property of completeness is what separates analysis from geometry and 
algebra. It requires the use of approximation, infinity and more dynamic visu- 
alizations than algebra or classical geometry. The rest of this course is largely 
concerned with applications of completeness. 
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Theorem 2.3.6 (Archimedean Principle ). If a gM., then there exists e N 
such that Ua > a. 

Proof. If the theorem is false, then a is an upper bound for N. Let (3 = lubN. 
According to Theorem 2.3.3 there is an m € N such that m > j3 — 1. But, this 



Some other variations on this theme arc in the following corollaries. 
Corollary 2.3.7. Let o, 6 e M with a > 0. 

(a) There is an n gN such that an > b. 

(b) There is an n gN such that 0 < 1/n < a. 

(c) There is an n € N such that n — 1 < a < n. 

Proof, (a) Use Theorem 2.3.6 to find n € N where n > b/a. 

(b) Let 6 = 1 in part (a) . 

(c) Theorem 2.3.6 guarantees that 5* = {n e N : n > a} ^ 0. If n is the least 
element of this set, then n — 1 ^ S and n — 1 < a < n. □ 

Corollary 2.3.8. If I is any interval from R, then / n Q ^ 0 and InQ" ^iD. 
Proof. Left as an exercise. □ 

A subset of K which intersects every interval is said to be dense in R. Corol- 
lary 2.3.8 shows both the rational and irrational numbers are dense. 

2.4 Comparisons of Q and M 

All of the above still docs not establish that Q is different from K. In Theorem 
2.3.1, it was shown that the equation = 2 has no solution in Q. The following 
theorem shows x'^ = 2 does have solutions in R. Since a copy of Q is embedded 
in R, it follows, in a sense, that R is bigger than Q. 

Theorem 2.4.1. There is a positive a e R such that o? = 2. 

Proof Let S = {x> 0 : x'^ < 2}. Then 1 e 5, so 5 7^ 0. If a; > 2, then Theorem 
2.2.2(c) implies a;^ > 4 > 2, so 5 is bounded above. Let a = luhS. It will be 
shown that = 2. 

Suppose first that < 2. This assumption implies (2 — a^)/(2a + 1) > 0. 
According to Corollary 2.3.7, there is an n e N large enough so that 



is a contradiction because /3 = lub N < m -|- 1 e N. 



□ 
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Qfi = 


.ai(l) 


ai(2 


0-2 = 


•a2(l) 


"2(2 


as = 


•a3(l) 


"3(2 


a4 = 


.a4(l) 


"4(2 


"5 = 


.a5(l) 


as (2 



ai(3) ai(4) ai(5) 

"2(3) "2(4) "2(5) 

a3(3) a3(4) a3(5) 

a4(3) a4(4) a4(5) 

"5(3) a5(4) a5(5) 



Figure 2.1: The proof of Theorem 2.4.3 is called the "diagonal argument'" because 
it constructs a new number z by working down the main diagonal of the array shown 
above, making sure z{n) 7^ a„(n) for each n G N. 



Therefore, 



1\ , 2a 1 , 1 1 

- =a^H h^=a^ + - 2a+- 

n J n n'^ n \ n 



contradicts the fact that a = lubS". Therefore, a^ > 2. 
Next, assume a^ > 2. In this case, choose n e N so that 



n 2a 

Then 



1 a^ - 2 2a 

0 < - < — =^ 0 < — < a 



2 



\ 2 

1 \ 9 2a 1 9 2a 9 / 9 „n ^ 
= a^ h ^ > a^ > a^ - a^ - 2) = 2, 

again contradicts that a = lub S*. 

Therefore, a^ = 2. □ 

Theorem 2.3.1 leads to the obvious question of how much bigger M is than Q. 
First, note that since N C Q, it is clear that card (Q) > Hg. On the other hand, 
every g G Q has a unique reduced fractional representation q = m{q)/n{q) with 
m{q) G Z and n{q) S N. This gives an injective function f : Q Z xN defined 
by fiq) — {m{q),n{q)), and we conclude card (Q) < card (Z x N) = Hq. The 
following theorem ensues. 



Theorem 2.4.2. card((n)) = H 



0- 



In 1874, Georg Cantor first showed that M is not countable. The following 
proof is his famous diagonal argument from 1891. 

Theorem 2.4.3. card (M) > Hq 
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Proof. It suffices to prove that card ([0, 1]) > Kq. If this is not true, then there 
is a bijection a : N — )■ [0, 1]; i.e., 



Each X G [0, 1] can be written in the decimal form x = /lO" where 

x{n) G {0,1,2,3,4,5,6,7,8,9} for each n e N. This decimal representation is 
not necessarily unique. For example. 



In such a case, there is a choice of x{n) so it is constantly 9 or constantly 0 from 
some N onward. When given a choice, we will always opt to end the number 
with a string of nines. With this convention, the decimal representation of x is 
unique. 

Define z G [0, 1] by choosing z{n) G {d G w : d < 8} such that z{n) ^ a„(n). 
Let z = z{n) /lO". Since z G [0, 1], there is an n G N such that z = a{n). 

But, this is impossible because z{n) differs from q;„ in the nth decimal place. 
This contradiction shows card ([0, 1]) > Kq. □ 

Around the turn of the twentieth century these then- new ideas about infinite 
sets were very controversial in mathematics. This is because some of these ideas 
are very unintuitive. For example, the rational numbers are a countable set 
and the irrational numbers are uncountable, yet between every two rational 
numbers are an uncountable number of irrational numbers and between every 
two irrational numbers there are a countably infinite number of rational numbers. 
It would seem there are either too few or too many gaps in the sets to make this 
possible. Such a seemingly paradoxical situation flies in the face of our intuition, 
which was developed with flnite sets in mind. 

This brings us back to the discussion of cardinalities and the Continuum 
Hypothesis at the end of Section 1.5. Most of the time, people working in 
real analysis assume the Continuum Hypothesis is true. With this assumption 
and Theorem 2.4.3 it follows that whenever ^ c M, then either card (A) < Hq 
or card (yl) = card (M) = card (CP(N)).^ Since CP(N) has many more elements 
than N, any countable subset of M is considered to be a small set, in the sense 
of cardinality, even if it is infinite. This works against the intuition of many 
beginning students who are not used to thinking of Q, or any other infinite set 
as being small. But it turns out to be quite useful because the fact that the 
union of a countably infinite number of countable sets is still countable can be 
exploited in many ways.^ 

In later chapters, other useful small versus large dichotomies will be found. 

^Sincc K() is the smallest infinite cardinal, Ki is used to denote the smallest uncountable 
cardinal. You will also see card (M) = c, where c is the old-style German letter c, standing 
for the "cardinality of the continuum." Assuming the continuum hypothesis, it follows that 
Ko < Ki = c. 

''See Problem 22 on page 1-21. 
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2.5 Exercises 

2.1. Prove that if a, & G F, where F is a field, then (— a)6 = —{ab) = a{—b). 

2.2. Prove Corollary 2.2.4. If a > 0, then so is a~^. If o > 0, then so is a~^. 

2.3. Prove |a;| < y iff —y < x < y. 

2.4. If 5 C R is bounded above, then 



2.5. Prove there is no set P C Z3 which makes Z3 into an ordered field. 

2.6. If a is an upper bound for S and a G S, then a = lub S. 

2.7. Let A and B be subsets of M that are bounded above. Define A + B = 
{a + b: ae AAbe B}. Prove that lub {A + B) = lub A + luhB. 

2.8. If A c Z is bounded below, then A has a least element. 

2.9. If F is an ordered field and a £ ¥ such that 0 < a < e for every £ > 0, 
then a = 0. 

2.10. Let xeM. Prove |x| < e for all e > 0 iff x = 0. 

2.11. If p is a prime number, then the equation = p has no rational 



2.12. If p is a prime number and £ > 0, then there are x,y € Q such that 
x'^ < p < y'^ < x"^ + s. 

2.13. If a < 6, then (a,6) nQ ^ 0. 

2.14. If 9 e Q and a e K \ Q, then g + a e K \ Q. Moreover, if g ^ 0, then 
aqeR\Q. 

2.15. Prove that if o < 6, then there is a g e Q such that a < \plq < b. 

2.16. Prove Corollary 2.3.8. 

2.17. If F is an ordered field and xi,X2, ■ ■ ■ ,Xn € F for some n G N, then 



lub S = gib {a; : a; is an upper bound for S}. 



solutions. 



n n 




(2.5) 



i=l i=l 



2.18. Prove Corollary 2.3.5. 



ji 
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2.19. Prove card (Q^) = c. 

2.20. If ^ c R and B = {a; : a; is an upper bound for A}, then lub(^) = 
glb(B). 
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Chapter 3 

Sequences 



3.1 Basic Properties 

Definition 3.1.1. A sequence is a function a : N K. 

Instead of using the standard function notation of a{n) for sequences, it is 
usually more convenient to write the argument of the function as a subscript, 

a„. 

Example 3.1.1. Let the sequence a„ = 1 — 1/n. The first three elements are 
ai = 0,02 = 1/2, as = 2/3, etc. 

Example 3.1.2. Let the sequence 6„ = 2". Then bi =2, 62 = 4, 63 = 8, etc. 

Example 3.1.3. If a and r are constants, then a sequence given by ci = a, 
C2 = ar, C3 = ar^ and in general c„ = ar"""'^ is called a geometric sequence. 
The number r is called the ratio of the sequence. Staying away from the trivial 
cases where a = Oorr = 0, a geometric sequence can always be recognized by 
noticing that ^^^^ = r for all n e N. Example 3.1.2 is a geometric sequence 
with a = r = 2. 

Example 3.1.4. If a and d are constants, then a sequence of the form dn = 
a + (n — l)d is called an arithmetic sequence. Another way of looking at this is 
that dn is an arithmetic sequence if (i„+i — d„ = d for all n € N. 

Example 3.1.5. Some sequences are not defined by an explicit formula, but are 
defined recursively. This is an inductive method of definition in which successive 
terms of the sequence arc defined by using other terms of the sequence. The most 
famous of these is the Fibonacci sequence. To define the Fibonacci sequence, 
/„, let /i = 0, /2 = 1 and for n > 2, let /„ = fn-2 + fn-i- The first few terms 
are 0, 1, 1, 2,3,5,8,.... There actually is a simple formula that directly gives 
fn, but we leave its derivation as Exercise 3.1.5. 

It's often inconvenient for the domain of a sequence to be N, as required by 
Definition 3.1.1. For example, the sequence beginning 1, 2, 4, 8, . . . can be writ- 
ten 2°, 2^, 2^, 2'^, Written this way, it's natural to let the sequence function 



3-1 



3-2 



CHAPTERS. SEQUENCES 



be 2" with domain lo. As long as there is a simple substitution to write the 
sequence function in the form of Definition 3.1.1, there's no reason to adhere to 
the letter of the law. In general, the domain of a sequence can be any set of the 
form {n e Z : n > for some A'' S Z. 

Definition 3.1.2. A sequence a„ is hounded if {a„ : n <E N} is a bounded set. 
This definition is extended in the obvious way to hounded ahove and hounded 
below. 

The sequence of Example 3.1.1 is bounded, but the sequence of Example 
3.1.2 is not, although it is bounded below. 

Definition 3.1.3. A sequence a„ converges to L e R if for all £ > 0 there exists 
an A'' G N such that whenever n > N, then |a„ — L| < £. If a sequence does not 
converge, then it is said to diverge. 

When a„ converges to L, we write lim„^oo = L, or often, more simply, 
a„ L. 

Example 3.1.6. Let a„ = 1 — 1/n be as in Example 3.1.1. We claim a„ — )■ 1. To 
see this, let e > 0 and choose A e N such that 1/N < e. Then, if n > A 

|a„ - 1| = 1(1 - 1/n) - 1| = 1/n < 1/N < s, 

so a„ — )• 1. 

Example 3.1.7. The sequence 6„ = 2" of Example 3.1.2 diverges. To sec this, 
suppose not. Then there is an L G M such that 6„ — >■ L. If £ = 1, there must be 
an A e N such that |6„ - L\ < e whenever n> N. Choose n> N. |i - 2"| < 1 
implies L < 2" + 1. But, then 

b„+i -L = 2"+i -L> 2"+^ - (2" + 1) = 2" - 1 > 1 = £. 

This violates the condition on A. We conclude that for every L e M there 
exists an £ > 0 such that for no A e N is it true that whenever n > A, then 
\bn — L\ < s. Therefore, 6„ diverges. 

Definition 3.1.4. A sequence a„ diverges to oo if for every B > 0 there is an 
A G N such that n> N implies a„ > B. The sequence a„ is said to diverge to 
— oo if — a„ diverges to oo. 

When a„ diverges to oo, we write lim„^oo «n = oo, or often, more simply, 
a„ — >■ oo. 

A common mistake is to forget that a„ — >■ oo actually means the sequence 
diverges in a particular way. Don't be fooled by the suggestive notation into 
treating oo as a number! 

Example 3.1.8. It is easy to prove that the sequence a„ = 2" of Example 3.1.2 

diverges to oo. 

Theorem 3.1.1. //a„ — >■ L, then L is unique. 



June 2, 2014 



http;/ /math.louisville.cdu/r^loc/ira 



3.1. BASIC PROPERTIES 



3-3 



Proof. Suppose a„ — > Li and a„ — > L2. Let e > 0. According to Definition 
3.1.2, there exist Ni,N2 € N such that n > Ni imphes |a„ — Li| < e/2 and 
n> N2 impUes |a„ - L^] < e/2. Set N = max{iVi, N2}. Iin> N, then 

\Li - L2I = 1^1 - an + an - ^2! < \Li - a„| + |a„ - L2I < e/2 + e/2 = e. 

Since s is an arbitrary positive number an apphcation of Problem 10 shows 

Li=L2. □ 

Theorem 3.1.2. Un L iff for all e > 0, the set {n : a„ ^ (L — e, L + e)} is 
finite. 

Proof. (=>) Let e > 0. According to Definition 3.1.2, there is an G N such that 
{a„ : n > A'} c {L-e,L + e). Then {n : a„ ^ {L-e,L + e)} C {1, 2, . . . , A'- 1}, 
which is finite. 

('^=) Let e > 0. By assumption {n : a„ ^ (L — e, L + e)} is finite, so let 
A' = max{n : a„ ^ (L — e, L + e)} + 1. li n> N, then Un G {L — e, L + e). By 
Definition 3.1.2, Un ^ L. □ 

Corollary 3.1.3. //a„ converges, then an is bounded. 

Proof Suppose a„ — >■ L. According to Theorem 3.1.2 there axe a finite number 
of terms of the sequence lying outside {L — 1,L + 1). Since any finite set is 
bounded, the conclusion follows. □ 

The converse of this theorem is not true. For example, a„ = ( — 1)" is 
bounded, but does not converge. The main use of Corollary 3.1.3 is as a quick 
first check to see whether a sequence might converge. It's usually pretty easy 
to determine whether a sequence is bounded. If it isn't, it must diverge. 

The following theorem lets us analyze some complicated sequences by break- 
ing them down into combinations of simpler sequences. 

Theorem 3.1.4. Let an and bn be sequences such that a„ — >■ A and bn — >■ B. 
Then 

(a) an + bn^ A + B, 

(b) a„6„ — > AB , and 

(c) ttn/bn — ^ A/B as long as bn y^O for alln £N and B ^ 0. 

Proof. (a) Let e > 0. There are Ni, N2 G N such that n > Ni implies |a„ — 
A\ < e/2 and n > N2 implies |6„ - B\ < e/2. Define A^ = max{Afi, ATa}. 
If n > A', then 

\ia„ + bn) - {A + B)\ < \an - A\ + \bn - B\ <e/2 + e/2 = e. 
Therefore an + bn^ A + B. 
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1 1 




B-bn 


< 


B^e/2 






b„B 


{B/2)B 



(b) Let e > 0 and a > 0 be an upper bound for |a„|. Choose Ni,N2 € N such 
thatn>iVi =^ \a„-A\ < e/2{\B\+l) a-n.dn> N2 =4> \bn-B\ < s/2a. 
Un>N = max{iVi, N2}, then 

|a„6„ - AB\ = |a„6„ - a„B + a„S - AB\ 

< \anbn - anB\ + \anB - AB\ 
= \an\\bn - B\ + \\B\\an - A\ 

< s/2 + s/2 = s. 

(c) First, notice that it suffices to show that 1/B, because part (b) of 
this theorem can be used to achieve the full result. 

Let £ > 0. Choose N gN so that the following two conditions are satisfied: 
n>N^ \bn\ > \B\/2 and |6„ - B\ < B^s/2. Then, when n>N, 

Therefore l/6„ 1/B. 

□ 

If you're not careful, you can easily read too much into the previous theorem 
and try to use its converse. Consider the sequences a„ = (—1)" and 6„ = — a„. 
Their sum, a„ + 6„ = 0, product fln^n = —1 and quotient ttn/bn = —1 all 
converge, but the original sequences diverge. 

It is often easier to prove that a sequence converges by comparing it with a 
known sequence than it is to analyze it directly. For example, a sequence such 
as o„ = sin^ n/n^ can easily be seen to converge to 0 because it is dominated 
by The following theorem makes this idea more precise. It's called the 

Sandwich Theorem here, but is also called the Squeeze, Pinching, Pliers or 
Comparison Theorem in different texts. 

Theorem 3.1.5 (Sandwich Theorem). Suppose a„, b„ and Cn are sequences 
such that ttn <bn < Cn for all n gN. 

(a) If Cn ^ L and Cn L, then bn — L. 

(b) Ifbn — >■ 00, then Cn 00. 

(c) Ifbn —00, then an —00. 

Proof, (a) Let e > 0. There is an TV G N large enough so that when n > N, 
then L — e < an and c„ < L + e. These inequalities imply L — s < Un < 
bn < Cn < L + e. Theorem 3.1.2 shows c„ — > L. 

(b) Let B > 0 and choose TV € N so that n > N => bn > B. Then 
Cn>bn>B whenever n> N. This shows c„ — )• 00. 

(c) This is essentially the same as part (b). 

□ 



June 2, 2014 



http:/ /math.louisville.cdu/r^loc/ira 



3.2. MONOTONE SEQUENCES 



3-5 



3.2 Monotone Sequences 

One of the problems with using the definition of convergence to prove a given 

sequence converges is the Hmit of the sequence must be known in order to verify 
the sequence converges. This gives rise in the best cases to a "chicken and egg" 
problem of somehow determining the hmit before you even know the sequence 
converges. In the worst case, there is no nice representation of the limit to 
use, so you don't even have a "target" to shoot at. The next few sections are 
ultimately concerned with removing this deficiency from Definition 3.1.2, but 
some interesting side-issues are explored along the way. 
Not surprisingly, we begin with the simplest case. 

Definition 3.2.1. A sequence a„ is increasing, if a„+i > a„ for all n e N. It 
is strictly increasing if a„+i > a„ for all n € N. 

A sequence a„ is decreasing, if a„+i < a„ for all n e N. It is strictly 
decreasing if a„+i < a„ for all n G N. 

If a„ is any of the four types listed above, then it is said to be a monotone 
sequence. 

Notice the < and > in the definitions of increasing and decreasing sequences, 
respectively. Many calculus texts use strict inequalities because they seem to 
better match the intuitive idea of what an increasing or decreasing sequence 
should do. For us, the non-strict inequalities are more convenient. 

Tiieorem 3.2.1. A bounded monotone sequence converges. 

Proof. Suppose a„ is a bounded increasing sequence, L = lub {a„ : n e N} and 
e > 0. Clearly, a„ < L for all n e N. According to Theorem 2.3.3, there 

exists an G N such that qn > L — e. Because the sequence is increasing, 
L > On > > L ~ £ ior all n > N. This shows a„ — t- L. 

If o„ is decreasing, let 6„ = — a„ and apply the preceding argument. □ 

The key idea of this proof is the existence of the least upper bound of the 
sequence when viewed as a set. This means the Completeness Axiom implies 
Theorem 3.2.1. In fact, it isn't hard to prove Theorem 3.2.1 also implies the 
Completeness Axiom, showing they are equivalent statements. Because of this. 
Theorem 3.2.1 is often used as the Completeness Axiom on M instead of the 
least upper bound property we used in Axiom 8. 

Example 3.2.1. The sequence e„=(l-|-^)" converges. 

Looking at the first few terms of this sequence, ei = 2, 62 = 2.25, 63 « 2.37, 
64 « 2.44, it seems to be increasing. To show this is indeed the case, fix n € N 
and use the binomial theorem to expand the product as 




(3.1) 
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and 



fe=o 

For 1 < fc < n, the fcth term of (3.1) is 



n\ 1 n(n - l)(n - 2) • • • (n - (fc - 1)) 



In— In — 2 n— fc + 1 

k\ n n n 



fc! V n 



n / \ n 



fc!\ n + I J \ n + I J \ n + 1 
(n + l)n(n - l)(n - 2) • • • (n + 1 - (fc - 1)) 
fc!(n + 1)'= 

n + 1\ 1 



fc Jin + 1)''' 

which is the fcth term of (3.2). Since (3.2) also has one more positive term in 
the sum, it follows that e„ < e„+i, and the sequence e„ is increasing. 
Noting that 1/fc! < 1/2'=-^ for fc e N, equation (3.1) implies 

n\ 1 n! 1 



kj nf' k\{n — k)\ 

n— In — 2 n — fc + 11 
n n n k\ 

1 

<fc! 

1 

- W^' 



Substituting this into (3.1) yields 

1 

k=0 



E 



kin 



k 



<1 + 1+- + - + --- + 



2 4 2"-i 

1 - 



= 1 + — ^<3 

^ 2 



so e„ is bounded. 

Since e„ is increasing and bounded, Theorem 3.2.1 implies e„ converges. 
Of course, you probably remember from your calculus course that e„ — >■ e « 
2.71828. 
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3.3. SUBSEQUENCES AND THE BOLZANO- WEIERSTRASS THEORES-iT 



Theorem 3.2.2. An unbounded monotone sequence diverges to oo or — oo, de- 
pending on whether it is increasing or decreasing, respectively. 

Proof. Suppose a„ is increasing and unbounded. If i? > 0, the fact that a„ 
is unbounded yields a,n N G N such that ajv > B. Since a„ is increasing, 

o-n ^ In > B for all n > N. This shows a„ — > oo. 

The proof when the sequence decreases is similar. □ 

3.3 Subsequences and the Bolzano- Weierstrass 
Theorem 

Definition 3.3.1. Let a„ be a sequence and ct : N — >■ N be a function such that 
m < n implies a{m) < cr(n); i.e., a is strictly increasing sequence of natural 
numbers. Then 6„ = a o a{n) = a^(n) is a subsequence of a„. 

The idea here is that the subsequence 6„ is a new sequence formed from an 

old sequence a„ by possibly leaving terms out of a„. In other words, we see 
all the terms of 6„ must also appear in a„, and they must appear in the same 
order. 

Example 3.3.1. Let a{n) = 3n and a„ be a sequence. Then the subsequence 
a^(^n) looks like 

«3; ^6) Cg, . . . , asn, ■ ■ ■ 

The subsequence has every third term of the original sequence. 
Example 3.3.2. If a„ = sin(n7r/2), then some possible subsequences are 

bn = Ctin+l =^ bn = 1, 

and 

d„ = a„2 =^ rf„ = (1 + (-l)"+i)/2. 

Theorem 3.3.1. «„ L iff every subsequence of converges to L. 

Proof. (=>) Suppose a : N — >■ N is strictly increasing, as in the preceding defini- 
tion. With a simple induction argument, it can be seen that (T{n) > n for all n. 
(See Exercise 3.8.) 

Now, suppose a„ ^ L and 6„ = a„(^n) is a subsequence of a„. If e > 0, there 
is an A/' e N such that n> N implies a„ e {L — e,L + e). From the preceding 
paragraph, it follows that when n> N, then 6„ = ao-(n) = (^m for some m > n. 
So, bn & {L — e,L + e) and 6„ — >■ L. 

{<=) Since a„ is a subsequence of itself, it is obvious that an ^ L. □ 

The main use of Theorem 3.3.1 is not to show that sequences converge, but, 
rather to show they diverge. It gives two strategies for doing this: find two 
subsequences converging to different limits, or find a divergent subsequence. In 
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Example 3.3.2, the subsequences 6„ and c„ demonstrate the first strategy, while 
dn demonstrates the second. 

Even if the original sequence diverges, it is possible there are convergent 
subsequences. For example, consider the divergent sequence a„ = (—1)". In 
this diverges, but the two subsequences a2n = 1 and a2„+i = — 1 are 

constant sequences, so they converge. 

Theorem 3.3.2. Every sequence has a monotone subsequence. 

Proof. Let a„ be a sequence and T = {n G N : m > n > There 

are two cases to consider, depending on whether T is finite. 

First, assume T is infinite. Define (t(1) = minT and assuming cr(n) is 
defined, set a{n + 1) = minT \ {a{l),a{2), . . . , <j{n)}. This inductively defines 
a strictly increasing function cr : N — )• N. The definition of T guarantees a„(^n) 
is an increasing subsequence of a„. 

Now, assume T is finite. Let a(l) = maxT + 1. If a{n) has been chosen for 
some n > maxT, then the definition of T implies there is an rn > a{n) such that 
flm !i ia(n)- Set (T(n + 1) = m. This inductively defines the strictly increasing 
function a : N — >■ N such that a<^(„) is a decreasing subsequence of a„. □ 

If the sequence in Theorem 3.3.2 is boimded, then the corresponding mono- 
tone subsequence is also bounded. Recalling Theorem 3.2.1, we see the following. 

Theorem 3.3.3 (Bolzano- Weierstrass). Every bounded sequence has a conver- 
gent subsequence. 

3.4 Lower and Upper Limits of a Sequence 

There are an uncountable number of strictly increasing functions cr ; N N, so 
every sequence a„ has an uncountable number of subsequences. If a„ converges, 
then Theorem 3.3.1 shows all of these subsequences converge to the same limit. 
It's also apparent that when a„ — >■ oo or a„ — >■ — oo, then all its subsequences 
diverge in the same way. When a„ does not converge, or diverge to ±oo, the 
situation is a bit more difficult. 

Example 3.4.1. Let Q = {qn : n E N} and a G M. Since every interval contains 
an infinite number of rational numbers, it is possible to choose cr(l) = min{fc : 
\qk — a\ < 1}. In general, assuming a{n) has been chosen, choose a{n -I- 1) = 
min{fc > a{n) : \qk — a\ < 1/n}. Such a choice is always possible because 
Q n (a — 1/n, a + 1/n) \ {qk : k < cr(n)} is infinite. This induction yields a 
subsequence qa(n) of converging to a. 

If a„ is a sequence and 6„ is a convergent subsequence of o„ with 6„ — > L, 
then L is called an accumulation point of a„. A convergent sequence has only 
one accumulation point, but a divergent sequence may have many accumulation 
points. As seen in Example 3.4.1, a sequence may have all of K as its set of 
accumulation points. 
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To make some sense out of this, suppose a„ is a bounded sequence, and 
Tn = {ofe '■ k > n}. Define 

in = glbT„ and ;ti„ = lubT„. 

Because T„ D T„+i, it follows that for all n G N, 

^1 < 4 < ^n+l < Mn+l < < Ml- (3.3) 

Theorem 3.2.1 implies both in and /i„ converge. If in ^ i and /i„ — > /x, (3.3) 
shows for all n, 

4 < ^ < M < Mn. (3.4) 

Suppose 6„ /3 is any convergent subsequence of a„. From the definitions 
of in and /U„, it is seen that in ^ bn < Hn for all n. Now (3.4) shows i < (3 < 
The normal terminology for i and yit is given by the following definition. 

Definition 3.4.1. Let a„ be a sequence. If a„ is bounded below, then the lower 
limit of a„ is 

liminfa„ = lim glbjafe : k > n}. 
If a„ is bounded above, then the upper limit of a„ is 

lim sup a„ = lim lub{afc : k>n}. 

When ttn is unbounded, the lower and upper limits are set to appropriate infinite 
values, while recalling the familiar warnings about oo not being a number. 

The next theorem summarizes the observations presented above. 

Theorem 3.4.1. Let a„ be a sequence. 

(a) lim inf a„ < lim sup a„ . 

(b) If bn is any convergent subsequence of a„, then 

lim inf o„ < lim 6„ < lim sup a„. 

n— >cx) 

(c) lim inf a„ = lim sup a„ iff an converges. 

Suppose that a„ is bounded above and both /i„ and /i arc as in the discussion 
preceding the definition. Choose cr(l) so 0^(1) > /xi — 1. If cr(n) has been chosen 
for some n gN, then choose a{n + 1) > cr(n) to satisfy 

> a<7(n+i) > lub {ttfc : k > a{n)} - 1/n = u„(^n) - 

This inductively defines a subsequence a„(^n) ~^ where the convergence is 

guaranteed by Theorem 3.1.5, the Sandwich Theorem. A similar argument can 
be made for lim inf a„. The following theorem has been proved. 

Theorem 3.4.2. Let a„ be a sequence. 

(a) If j3 is an accumulation point of an, then lim inf a„ < /3 < lim sup a„. 

(b) There are subsequences of an converging to lim inf o„ and lim sup a„ . There- 
fore, whenever they are finite, lim inf a„ and lim sup a„ are accumulation 
points of On- 
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3.5 The Nested Interval Theorem 

Definition 3.5.1. A collection of sets {5„ : n e N} is said to be nested, if 
Sn+i C S„ for all n e N. 

Theorem 3.5.1 (Nested Interval Theorem). If {In — [dn^bn] : n G N} is a 
nested collection of closed intervals such that lim„_).oo (&n — dn) = 0, then there 
is an X gM. such that O^eN ~ {^}- 

Proof. Since the intervals are nested, it's clear that a„ is an increasing sequence 
bounded above by hi and is a decreasing sequence bounded below by a^. 
Applying Theorem 3.2.1 twice, we find there are a, /? e M such that a„ ^ a and 

We claim a = ji. To see this, let e > 0 and use the "shrinking" condition 

on the intervals to pick iV G N so that — aw < £■ The nestedness of the 
intervals implies on < On < bn < bj^i for all n > N. Therefore 

ajv < lub {an ■ n > N} = a <bN and ajv < gib {6„ : n > N} = (3 < 6jv 

This shows ja — ^| < |6jv — ajv| <£• Since e > 0 was chosen arbitrarily, we 

conclude a = /3. 

Let X = a = fi. It remains to show that HneN = {^}- 

First, we show that x € HneN^"- this, fix TV € N. Since a„ increases 

to X, it's clear that x > qn- Similarly, x < 6jv- Therefore x £ [un, 6jv]- Because 

N was chosen arbitrarily, it follows that x € HneN-^"' 

Next, suppose there are x, y G HneN and let e > 0. Choose N such 

that 6jv — cijv < s- Then {a;, y} C PlneN C [un, b]\f] implies |a; — y| < e. Since 

e was chosen arbitrarily, we see x = y. Therefore PlneN = {x}- □ 

Example 3.5.1. If /„ = (0, 1/n] for all n € N, then the collection {/„ : n G N} 
is nested, but HneN-^" ~ ^- "^^^^ shows the assumption that the intervals be 
closed in the Nested Interval Theorem is necessary. 

Example 3.5.2. If /„ = [n, oo) then the collection {/„ : n G N} is nested, but 
PlneN-^" ~ ^- T^^^s shows the assumption that the lengths of the intervals 
be bounded is necessary. (It will be shown in Corollary 5.1.7 that when their 
lengths don't go to 0, then the intersection is nonempty, but the uniqueness of 
X is lost.) 



3.6 Cauchy Sequences 

Often the biggest problem with showing that a sequence converges using the 
techniques we have seen so far is that we must know ahead of time to what it 
converges. This is the "chicken and egg" problem mentioned above. An escape 
from this dilemma is provided by Cauchy sequences. 

"'^June 2, 2014 ©Lee Larson (LLarson@LouisvilIe.edu) 
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Definition 3.6.1. A sequence a„ is a Cauchy sequence if for all e > 0 there is 

an TV G N such that n,m> N implies |a„ — am\ < £■ 

This definition is a bit more subtle than it might at first appear. It sort 
of says that all the terms of the sequence are close together from some point 
onward. The emphasis is on all the terms from some point onward. To stress 
this, first consider a negative example. 

Example 3.6.1. Suppose a„ = J2k=i 1/^ for n € N. There's a trick for showing 
the sequence a„ diverges. First, note that a„ is strictly increasing. For any 
n e N, consider 



1 ^ n-l 2^-1 ^ 



X! /L^ ^ 20 +k 

fe=l j=o fe=0 

ri-l 2'-l ri-1 1 

Ex ^ i -i ^ i n 

j=o fe=0 j=o 

Hence, the subsequence -i is unbounded and the sequence a„ diverges. (To 
see how this works, write out the first few sums of the form -1.) 

On the other hand, |a„+i — a„| = l/(n + 1) — >■ 0 and indeed, if m is fixed, 
|a„+m — Sn\ 0. This makes it seem as though the terms are getting close 
together, as in the definition of a Cauchy sequence. But, s„ is not a Cauchy 
sequence, as shown by the following theorem. 

Theorem 3.6.1. A sequence converges iff it is a Cauchy sequence. 

Proof. (^) Suppose an ^ L and e > 0. There is an A'' e N such that n > N 
imphes |a„ — L\ < e/2. If m,n > N, then 

\am - an\ = \am - L + L - a„| < \am - L\ + \L - a^l < e/2 + e/2 = e. 

This shows a„ is a Cauchy sequence. 

(<^=) Let a„ be a Cauchy sequence. First, we claim that a„ is bounded. To 
see this, let e = 1 and choose N N such that n.m > N implies |a„ — a„i| < 1. 
In this case, ajv — 1 < (in < aw + 1 for all n> N, so {a„ : n > N} is a bounded 
set. The set {a„ : n < N}, being finite, is also bounded. Since {a„ : n e N} is 
the union of these two bounded sets, it too must be bounded. 

Because a„ is a bounded sequence. Theorem 3.3.3 implies it has a convergent 
subsequence 6„ = 0^(71) ~^ Let e > 0 and choose N G N so that n,m > N 
implies |o„ — am\ < e/2 and |6„ — L\ < e/2. U n> N, then a{n) > n> N and 

|a„ - L\ = |a„ -bn + bn- L\ 

< |a„ - bn\ + \bn - L\ 

= |a„ - a^(„)| + \bn - L\ 

< e/2 + e/2 = e. 

Therefore, Un ^ L. □ 



http:// math.louisville.edu/ --^lee/ir a 



3-12 



CHAPTERS. SEQUENCES 



The fact that Cauchy sequences converge is yet another equivalent version 
of completeness. In fact, most advanced texts define completeness as "Cauchy 
sequences converge." This is convenient in general spaces because the definition 
of a Cauchy sequence only needs the metric on the space and none of its other 
structure. 

A typical example of the usefulness of Cauchy sequences is given below. 

Definition 3.6.2. A sequence a;„ is contractive if there is a c e (0, 1) such that 
\xk+i — Xk\ < c\xk — Xk-i\ for all A; > 1. c is called the contraction constant. 

Theorem 3.6.2. If a sequence is contractive, then it converges. 

Proof. Let be a contractive sequence with contraction constant c € (0, 1). 
We first claim that if n e N, then 

- < c"~^|xi - a;2|. (3.5) 

This is proved by induction. When n = l, the statement is 

— < c'^lxi — X2I = \xi — X2I, 

which is trivially true. Suppose that |x„ — a;„_|_i| < c"~^\xi — X2I for some 
n e N. Then, from the definition of a contractive sequence and the induction 
hypothesis, 

l^^n+i - Xn+2\ < c|x„ - x„+i| < c (c""^|.Ti - X2\) c"|a;i | . 

This shows the claim is true in the case n + 1. Therefore, by induction, the 
claim is true for all n € N. 

To show Cauchy sequence, let £ > 0. Since c" 0, we can choose 

A'' G N so that 



(1-c) 

Let n > m > N. Then 



|a;i - X2\ < e. (3.6) 



\Xn Xiji\ — \Xji Xji—i -\- Xji—i Xji—2 ~t~ Xji^2 ' ' ' '^m+l ~t~ ^ri 

< \Xn - Xn-l\ + \Xn-l " Xn-2\ H h \Xm+l " Xm\ 

Now, use (3.5) on each of these terms. 

< c"-2|a;i - X2\ + c"-^|a;i -X2\ + ---+ c"" - X2\ 
= \xi - a;2|(c"-' + c"-3 + • • • + c™-i) 

Apply the formula for a geometric sum. 

= \X^-X2\C'^-^ 



1-C 



<\xi-X2\y:^ (3-7) 
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Use (3.6) to estimate the following. 



■ X2\ 



< \Xi - X2\ 



1-c 

£ 

\xi — X2\ 



This shows ITy^ IS 3j Cauchy sequenceand must converge by Theorem 3.6.1. □ 

Example 3.6.2. Let — 1 < r < 1 and define the sequence s„ = X^^^o'''^- (You 
no doubt recognize this as the geometric series from your calculus course.) If 
r = 0, the convergence of s„ is trivial. So, suppose r 7^ 0. In this case, 

n+l 



\Sn+l 



r < 1 



and Sn is contractive. Theorem 3.6.2 implies s„ converges. 

Example 3.6.3. Suppose f{x) = 2 + 1/a;, ai = 2 and fln+i = /((Xn) for n e N. 
It is evident that a„ > 2 for all n. Some algebra gives 



— fin 




/(/(an-i)) 


- /(a«-i) 


1 


^n— 1 




./(On-l) 




1 + 2a„_i 



1 



< 



This shows a„ is a contractive sequence and, according to Theorem 3.6.2, a„ — >■ 
L for some L > 2. Since, a„+i = 2 + l/o„, taking the limit as n ^ 00 of both 
sides gives _L = 2 + 1/L. A bit more algebra shows L = 1 + \pl. 

L is called & fixed point of the function /; i.e. /(i) = L. Many approximation 
techniques for solving equations involve such iterative techniques depending 
upon contraction to find fixed points. 

The calculations in the proof of Theorem 3.6.2 give the means to approximate 
the fixed point to within an allowable error. Looking at line (3.7), notice 



xXr, 



Xr, 



< \X\ — X2\ 



Let n 



1-c 

00 in this inequality to arrive at the error estimate 

1 



X'n 



2^2 



1 



(3.8 



In Example 3.6.3, ai = 2, 02 = 5/2 and c < 1/5. Suppose we want to 
approximate L to 5 decimal places of accuracy. This means we need |a„ — L| < 
5 X 10~^. Using (3.8), with m = 9 shows 



|Oi - a2\ 



1 



< 1.6 X 10"*^. 



Some arithmetic gives 0,9 w 2.41421. The calculator value of L 
2.414213562, confirming our estimate. 
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3.7 Exercises 

Stt- — 1 

3.1. Let the sequence a„ = . Use the definition of convergence for a 

3n + 2 

sequence to show a„ converges. 

3.2. If a„ is a sequence such that — L and 0271+1 — )• L, then an ^ L. 

3.3. Let a„ be a sequence such that a2n — ^ and 02n — a2n-i ^ 0. Then 
a„ A. 

3.4. If a„ is a sequence of positive numbers converging to 0, then 0. 

3.5. Find examples of sequences a„ and 6„ such that a„ — )• 0 and 6„ — )• 00 
such that 

(a) ttnbn 0 

(b) a„6„ ^ 00 

(c) lim„_>.oo cin^n does not exist, but a„&„ is bounded. 

(d) Given c G M, a„6„ — >■ c. 

3.6. If Xn and are sequences such that Um„^oo Xn = L ^ 0 and Um„^oo s^nj/n 
exists, then hm„^oo Un exists. 

3.7. Determine the limit of a„ = \/n!. (Hint: If n is even, then n! > (n/2)"/^.) 

3.8. If cr : N — >■ N is strictly increasing, then a{n) > n for all n e N. 

3.9. Every unbounded sequence contains a monotonic subsequence. 

3.10. Find a sequence a„ such that given x e [0, 1], there is a subsequence 6„ 

of a„ such that b„ x. 

3.11. A sequence a„ converges to 0 iff |a„| converges to 0. 

3.12. Define the sequence a„ = for n € N. Show that |a„+i — a„| 0, 
but a„ is not a Cauchy sequence. 

3.13. Suppose a sequence is defined by ai = 0, ai = 1 and a„+i = 2(a„+a„_i) 
for n > 2. Prove a„ converges, and determine its limit. 

3.14. If the sequence a„ is defined recursively by ai = 1 and a„+i = \/an + 1, 
then show a„ converges and determine its limit. 

3.15. If a„ is a sequence such that linin^^ |a„+i/a„| = p < 1, then a„ — >■ 0. 
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3.16. Prove that the sequence a„ = n^/n! converges. 

3.17. Let a„ and 6„ be sequences. Prove that both sequences a„ and 6„ 
converge iff both a„ + 6„ and a„ — 6„ converge. 

3.18. Let a„ be a bounded sequence. Prove that given any e > 0, there is an 
interval / with length e such that {n : Un & 1} is infinite. Is it necessary that 
a„ be bounded? 

3.19. A sequence o„ converges in the mean if a„ = ^ J2k=i converges. 
Prove that if a„ L, then a„ i, but the converse is not true. 

3.20. Find a sequence a;„ such that for all n e N there is a subsequence of a;„ 
converging to n. 

3.21. If (In is a Cauchy sequence whose terms are integers, what can you say 
about the sequence? 

3.22. Show a„ = J2k=o ^/^^- ^ Cauchy sequence. 

3.23. If a„ is a sequence such that every subsequence of a„ has a further 
subsequence converging to L, then a„ L. 

3.24. If a, 6 e (0, oo), then show a/o" + 6" max{a, 6}. 

3.25. If 0 < a < 1 and s„ is a sequence satisfying |s„+i| < a|s„|, then s„ — >• 0. 

3.26. If c > 1 in the definition of a contractive sequence, can the sequence 
converge? 

3.27. If a„ is a convergent sequence and 6„ is a sequence such that ja^ — cinl > 
l&TO — for all m, n e N, then 6„ converges. 

3.28. If a„ > 0 for all n e N and a„ — >■ L, then y^o^ \fL. 

3.29. If ttTT, is a CcLUchy sequence cind. h-n is a subsequence of a„ such that 
6„ L, then a„ — )• L. 

3.30. Let a„ be a sequence. a„ — >■ L iflF limsupa„ = L = liminf a„. 

3.31. Is limsup(a„ + 6„) = limsupa„ + limsup6„? 

3.32. If a„ is a sequence of positive numbers, then liminf a„ = limsup l/a„. 

3.33. a„ = 1/n is not contractive. 

3.34. The equation — 4x + 2 = 0 has one real root lying between 0 and 1. 
Find a sequence of rational numbers converging to this root. Use this sequence 
to approximate the root to five decimal places. 
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3.35. Approximate a solution of a;^ — 5a; + 1 = 0 to within 10 ^ using a Cauchy 

sequence. 

3.36. Prove or give a counterexample: If a„ — ^ L and cr : N N is bijective, 
then bn = acr{n) converges. Note that 6„ might not be a subsequence of a„. (6„ 
is called a rearrangement of a„.) 
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Chapter 4 

Series 



Given a sequence a„, in many contexts it is natural to ask about the sum of all 
the numbers in the sequence. If only a finite number of the a„ are nonzero, this 
is trivial — and not very interesting. If an infinite number of the terms aren't 
zero, the path becomes less obvious. Indeed, it's even somewhat questionable 
whether it makes sense at all to add an infinite number of numbers. 

There are many approaches to this question. The method given below is the 
most common technique. Others are mentioned in the exercises. 



The idea behind adding up an infinite collection of numbers is a reduction to the 
well-understood idea of a sequence. This is a typical approach in mathematics: 
reduce a question to a previously solved problem. 

Definition 4.1.1. Given a sequence a„, the series having a„ as its terms is the 
new sequence 



The numbers .s„ arc called the partial sums of the series. If s„ — > 5 € M, then 
the series converges to S. This is normally written as 



Otherwise, the series diverges. 

The notation X^^i is imderstood to stand for the sequence of partial 
sums of the series with terms a„. When there is no ambiguity, this is often 
abbreviated to just X^fln. 



4.1 What is a Series? 



n 




oo 



4-1 



4-2 



Series 



Example 4.1.1. If a„ = (-1)" for n £ N, then si = -1, S2 = -1 + 1 = 0, 
S3 = — 1 + 1 — 1 = — 1 and in general 

_ (-1)"-! 

Sn - 2 

does not converge because it oscillates between —1 and 0. Therefore, the series 
1)" diverges. 

Example 4.1.2 (Geometric Series). Recall that a sequence of the form a„ = 
cr"~^ is called a geometric sequence. It gives rise to a series 

oo 

^ cr""^ = c + cr + cr^ + cr^ -\ 

n=l 

called a geometric series. The number r is called the ratio of the series. 



Suppose a„ 



for r ^ 1. Then, 



Si = 1, S2 = 1 + r, S3 = 1 + r + r , ... 
In general, it can be shown by induction (or even long division) that 

1 - r" 



(4.1) 



fc=i 



k=l 



The convergence of s„ in (4.1) depends on the value of r. Letting n — 00, it's 
apparent that s„ diverges when \r\ > 1 and converges to 1/(1 — r) when |r| < 1. 
When r = 1, s„ = n — >■ 00. When r = —1, it's essentially the same as Example 
4.1.1, and therefore diverges. In summary. 



for \r\ < 1, and diverges when \r\ > 1. This is called a geometric series with 
ratio r. 

Figure 4.1: Stepping to the wall. 



steps 



1 

distance from wall 



1/2 



In some cases, the geometric series has an intuitively plausible limit. If you 
start two meters away from a wall and keep stepping halfway to the wall, no 
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number of steps will get you to the wall, but a large number of steps will get 
you as close to the wall as you want. (See Figure 4.1.) So, the total distance 
stepped has limiting value 2. The total distance after n steps is the nth partial 
sum of a geometric series with ratio r = 1/2 and c = 1. 

Example 4.1.3 (Harmonic Series). The series Yl^=i 1/"' called the harmonic 
series. It was shown in Example 3.6.1 that the harmonic series diverges. 

Example 4.1.4. The terms of the sequence 

an = ^, — , neN. 
n'' +n 

can be decomposed into partial fractions as 

1 1 



n + 1 



If Sn is the series having a„ as its terms, then si = 1/2 = 1 — 1/2. We claim 
that s„ = 1 — l/(n + 1) for all n S N. To see this, suppose Sfe = 1 — 1/(A: + 1) 
for some fc € N. Then 

.fe+r=., + afc+i = l-^+(^-^) =1-^ 

and the claim is established by induction. Now it's easy to see that 

y = lim fl - —) = 1. 

+ n n^oo \ n + 2 I 

n=l ^ ' 

This is an example of a telescoping series. The name is apparently based on the 
idea that the middle terms of the series cancel, causing the series to collapse 
like a hand-held telescope. 

The following theorem is an easy consequence of the properties of sequences 
shown in Theorem 3.1.4. 

Theorem 4.1.1. Let ^ a„ and ^ 6„ he convergent series. 

(a) // c <E then ^con — c^On- 

(b) Y.{(^n + bn) = Y^an + Y^bn. 

(c) a„ -> 0 

Proof. Let = Yll^^i and i?„ = J2k=i ^fe sequences of partial sums 

for each of the two series. By assumption, there are numbers A and B where 
An ^ A and S„ B. 

(a) Z]fe=i ca/c = cJ2l=i dk = cAn -> cA. 

(b) J2k=li'^k + bk) = J2k=l o-k + YIk=l bk = An + Bn A + B. 

(c) For n > 1, 0„ = J2k=l °-k - Ya:=1 0-k = An- An-1 A- A = Q. □ 
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Notice that the first two parts of Theorem 4.1.1 show that the set of all 
convergent series is closed under linear combinations. 

Theorem 4.1.1(c) is most useful because its contrapositive provides the most 
basic test for divergence. 

Corollary 4.1.2 (Going to Zero Test). //a„ 7^ 0, then X^a„ diverges. 

Many have made the mistake of reading too much into Corollary 4.1.2. It 
can only be used to show divergence. When the terms of a series do tend to zero, 
that does not guarantee convergence. Example 4.1.3, shows Theorem 4.1.1(c) is 
necessary, but not sufficient for convergence. 

Another useful observation is that the partial sums of a convergent series 
are a Cauchy sequence. The Cauchy criterion for sequences can be rephrased 
for series as the following theorem, the proof of which is Exercise 4.1.3. 

Theorem 4.1.3 (Cauchy Criterion). Let X^fln be a series. The following state- 
ments are equivalent. 

(a) X^a„ converges. 

(b) For every s > 0 there is an N gN such that whenever n> m> N, then 



4.2 Positive Series 

Most of the time, it is very hard or impossible to determine the exact limit of a 
convergent series. We must satisfy ourselves with determining whether a series 
converges, and then approximating its sum. For this reason, the study of series 
usually involves learning a collection of theorems that might answer whether a 
given series converges, but don't tell us to what it converges. These theorems are 
usually called the convergence tests. The reader probably remembers a battery 
of such tests from her calculus course. There is a myriad of such tests, and the 
standard ones are presented in the next few sections, along with a few of those 
less widely used. 

Since convergence of a scries is determined by convergence of the sequence of 
its partial sums, the easiest series to study are those with well-behaved partial 
sums. Series with monotone sequences of partial sums are certainly the simplest 
such series. 

Definition 4.2.1. The series is a positive series, if a„ > 0 for all n. 

The advantage of a positive series is that its sequence of partial sums is 
nonnegative and increasing. Since an increasing sequence converges if and only 
if it is bounded above, there is a simple criterion to determine whether a positive 
series converges. All of the standard convergence tests for positive series exploit 
this criterion. 
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4.2.1 The Most Common Convergence Tests 

All beginning calculus courses contain several simple tests to determine whether 
positive series converge. Most of them are presented below. 

Comparison Tests 

The most basic convergence tests are the comparison tests. In these tests, the 
behavior of one scries is inferred from that of another series. Although they're 
easy to use, there is one often fatal catch: in order to use a comparison test, 
you must have a known series to which you can compare the mystery series. For 

this reason, a wise mathematician collects example scries for her toolbox. The 
more samples in the toolbox, the more powerful are the comparison tests. 

Theorem 4.2.1 (Comparison Test). Suppose ^ a„ and ^ 6„ are positive series 
with a„ < bn for all n. 

(a) // X] bn converges, then so does X] ■ 

(b) // ^ a„ diverges, then so does ^ 6„ . 

Proof. Let An and B„ be the partial sums of X^fln and Y^bn, respectively. It 
follows from the assumptions that An and i?„ are increasing and for all n e N, 



If ZZ^n = then (4.2) implies B is an upper bound for An, and X^a„ 
converges. 

On the other hand, if ^ a„ diverges, ^„ — >■ oo and the Sandwich Theorem 



Example 4.2.1. Example 4.1.3 shows that ^ 1/n diverges. Ifp < 1, then > 
1/n, and Theorem 4.2.1 implies ^ 1/n*' diverges. 

Example 4.2.2. The series ^sin^n/2" converges because 



for all n and the geometric series X] 1/2" = 1- 

Theorem 4.2.2 (Cauchy's Condensation Test""^). Suppose an is a decreasing 
sequence of nonnegative numbers. Then 



An<B, 



(4.2) 



3.1.5(b) shows Bn — >■ oo. 



□ 



• 2 1 

sm n 1 



2" ~ 2" 




a„ converges iff > 2"'a2n converges. 




The series 2" 02" is sometimes called the condensed series associated with ^ a, 
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ai + a2 + + + ar^ + + qy + as + ag + ■ ■ ■ + ais +aie + 



<2ao <ia. <5 



^ Oi ^ + 02 + as , + Q4 + «5 + Q6 + Q? + Q8 + Q9 + • • • + Ql5 , +Q16 + • • • 
>a2 >2a4 >4a8 >8ai6 

Figure 4.2: This diagram sliows tlie groupings used in inequality (4.3). 
Proof. Since is decreasing, for n e N, 

2"+^-l 2"-! 

^ Ofc < 2"a2. < 2 ^ ak. (4.3) 
(See Figure 4.2.1.) Adding for 1 < n < m gives 

2'"+i_l m 2""-! 

^ flfc < ^ 2^=02/= < 2 ^ flfc 

fc=2 fc=l fe = l 

and the theorem follows from the Comparison Test. □ 

Example 4.2.3 (p-series). For fixed p G M, the series ^ l/n^ is called a p-series. 
The special case when p = 1 is the harmonic series. Notice 

X/ (2")P ^ 51 (■^ ^) 

is a geometric series with ratio 2^^p, so it converges only when 2^^p < 1. Since 
2^~P < 1 only when p > 1, it follows from the Cauchy Condensation Test that 
the p-series converges when p > 1 and diverges when p < 1. (Of course, the 
divergence half of this was already known from Example 4.2.1.) 

The p-series are often useful for the Comparison Test, and also occur in many 
areas of advanced mathematics such as harmonic analysis and number theory. 

Theorem 4.2.3 (Limit Comparison Test). Suppose ^ a„ and ^ &„ are positive 
series with 

a = liminf — < limsup — ~ f3. (4-4) 

On K 

(a) IfaCz (0, 00) and ^ a„ converges, then so does ^ and if^ 6„ diverges, 
then so does ^ a„. 

(b) If /3 G (0,00) and'Y^hn diverges, then so does'Y^an, and if^On converges, 
then so does ^ 6„ . 
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Proof. To prove (a), suppose a > 0. There is an A/' e N such that 

a a 
2^ h 



n>N^^<^. (4.5) 



If n > A^, then (4.5) gives 

n n 

f E ''fc < E «fe (4.6) 

k=N k=N 

Part (a) now follows from the comparison test. 

The proof of (b) is similar. □ 

The following easy corollary is the form this test takes in most calculus books. 
It's easier to use than Theorem 4.2.3 and suffices most of the time. 

Corollary 4.2.4 (Limit Comparison Test). Suppose ^ a„ and ^ 6„ are positive 
series with 

a= lim ^. (4.7) 

n—i-oo On 

If a G (0,oo), then X^a„ and J2^n either both converge or both diverge. 
Example 4.2.4. To test the series 5^-^for convergence, let 

0"n = — and hn= 

2" - n "2" 

Then 

a„ 1/(2" - n) 2" 1 . ^ . 

lim — = iim — — = lim = lim 7— = 1 G (0, cxd). 

n^oo On rn-cx) 1/2" n^-oo 2" — n rn-oo 1 — n/2" 

Since 1/2" = 1, the original series converges by the Limit Comparison Test. 
Geometric Series- Type Tests 

The most important series is undoubtedly the geometric series. Several standard 
tests are basically comparisons to geometric series. 

Theorem 4.2.5 (Root Test). Suppose X^a„ is a positive series and 

p = lim sup ay". 

If p <1, then X]a„ converges. If p> 1, then X^a„ diverges. 

Proof. First, suppose p < 1 and r S (p, !)• There is an A'' e N so that al/" < r 
for all n > N. This is the same as a„ < r" for all n> N. Using this, it follows 
that when n> N, 



n N-1 n N-1 n 

^ ^ ak + ^ ak < ^ ttk + ^ 

fc=l fe=l fe=JV k=l k=N 



rK 
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Since X]fe=Ar^'^ is a partial sum of a geometric series with ratio r < 1, it must 

converge. 

If p > 1, there is an increasing sequence of integers fc„ — >■ oo such that 
"fef " > 1 foi" all n e N. This shows afc„ > 1 for all n e N. By Theorem 4.1.2, 
diverges. □ 

Example 4.2.5. For any x E M, the series ^ |a;"|/n! converges. To see this, note 
that according to Exercise 4.7, 



l/r, 



0 < 1. 



n! / (n!)V" 
Applying the Root Test shows the series converges. 

Example 4.2.6. Consider the p-serics and ^ The first diverges and 

the second converges. Since n^^" — > 1 and n^^" — > 1, it can be seen that when 
p = 1, the Root Test in inconclusive. 

Theorem 4.2.6 (Ratio Test). Suppose J2'^n is a positive series. Let 
r = lim mf < lim sup = R. 

If R < 1, then ^ a„ converges. Ifr> 1, then Yl^n diverges. 

Proof. First, suppose R < 1 and p G (i?, 1). There exists iV G N such that 
a„_|_i/a„ < p whenever n > N. This implies a„+i < pa„ whenever n > N. 
From this it's easy to prove by induction that aN+m < P^o-n whenever m e N. 
It follows that, for n> N, 



n 


N 


n 




= y^flfc + 




fe=i 


fe=i 


k=N+l 






n-N 






"Yj ^N+k 




fe=i 


k=l 




AT 


n-N 




fe=l 


Y "■Np'' 
fe=i 




AT 

< ^dfe + 

! 1 


aNP 
l-p 



Therefore, the partial sums of ^ a„ are bounded, and ^ a„ converges. 

If r > 1, then choose A'' e N so that a„+i > a„ for all n > N. It's now 
apparent that a„ t^- 0. □ 

In calculus books, the ratio test usually takes the following simpler form. 
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Corollary 4.2.7 (Ratio Test). Suppose X^a„ is a positive series. Let 

r = lim . 

Ifr<l, then J^O'n converges. If r > 1, then diverges. 

From a practical viewpoint, the ratio test is often easier to apply than the 
root test. But, the root test is actually the stronger of the two in the sense that 
there are series for which the ratio test fails, but the root test succeeds. (See 
Exercise 4.11, for example.) This happens because 

Um inf < Urn inf al/"" < lim sup al/"" < lim sup . (4.8) 

To see this, note the middle inequality is always true. To prove the right- 
hand inequality, choose r > lim sup a„+i/a„. It suffices to show lim sup ay™ < r. 
As in the proof of the ratio test, a„+fe < r'^an- This implies 



iJli. ' <r ' 



which leads to 

(r") 
Finally, 

r i/n 1- i/(n+fe)^,. /a„y/("+fc) 

lim sup a„' = lim sup a^^jj, < lim sup r I — 1 

fc— >oo k—^co ^ 

The left-hand inequality is proved similarly. 



= r. 



4.2.2 Kummer-Type Tests 

Most times the simple tests of the preceding section suffice. However, more dif- 
ficult scries require more delicate tests. There dozens of other, more specialized, 
convergence tests. Several of them are consequences of the following theorem. 

Theorem 4.2.8 (Rummer's Test). Suppose X^o„ is a positive series, Pn is a 
sequence of positive numbers and 

a = lim inf ( p„ — ) < Um sup ( p„ — p„+i J = /3 (4.9) 

\ Cln+l J \ (Xn+l / 

If a > 0, then X^a„ converges. IfJ2^/Pn diverges and (3 < 0, then X^a„ 
diverges. 

Proof. Let s„ = Ylk=i ^k, suppose a > 0 and choose r e (0, a). There must be 
an A'' > 1 such that ^ 

Pn — Pn+i >r, Vn > N. 

dn+l 
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Rearranging this gives 

Pnan - Pn+ldn+l > TUn+l, Vn > N. (4.10) 

For M > N, (4.10) implies 

M M 

{Pntln - Pn+lCln+l) > "^^ri+l 

n=N n=N 

PNttN - PM+iam+i > r{sM - sn-i) 
Pno-n - Pm+io-m+i + rsN-1 > tsm 

PNaN + rsN-i ^ 

> Sm 

r 

Since N is fixed, the left side is an upper bound for sm, and it follows that a„ 
converges. 

Next suppose ^ 1 /p„ diverges and /3 < 0. There must be an A'' e N such 
that ^ 

Pn — Pn+i < 0, Vn > N. 

«n+l 

This imphes 

Pnan < Pn+ian+1, Vn > N. 

Therefore, p„a„ > pjvajv whenever n> N and 

Pn 

Because N is fixed and '^l/pn diverges, the Comparison Test shows XI 
diverges. □ 

Rummer's test is powerful. In fact, it can be shown that, given any positive 

series, a judicious choice of the sequence Pn can always be made to determine 
whether it converges. (See Exercise 4.17, [15] and [14].) But, as stated, Rum- 
mer's test is not very useful because choosing p„ for a given series is often 
difficult. Experience has led to some standard choices that work with large 
classes of series. For example. Exercise 4.9 asks you to prove the choice p„ = 1 
for all n reduces Rummer's test to the standard ratio test. Other useful choices 
are shown in the following theorems. 

Theorem 4.2.9 (Raabe's Test). Let a„ he a positive series such that a„ > 0 
for all n. Define 

a = lim sup n ( — 1 ) > lim inf n ( -—^ 1 ) = ^ 

n^oo \an+l J "-i-oo V^^n+l / 

If a> 1, then converges. If (3 <\, then diverges. 

Proof. Let Pn = n in Rummer's test, Theorem 4.2.8. □ 
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When Raabc's test is inconclusive, there are even more deUcate tests, such 

as the theorem given below. 

Theorem 4.2.10 (Bertrand's Test). Let "^an be a positive series such that 
oin > 0 for all n. Define 



a = Hminf Inn I n I 1—1 < lim sup Inn I n I 1—1 = /3. 

rn-oo y \an+l ) J n^oo \ \an+l J J 

If a> 1, then X^a„ converges. If p <1, then X^a„ diverges. 

Proof. Let p„ = nlnn in Kummer's test. □ 

Example 4.2.7. Consider the series 

It's of interest to know for what values of p it converges. 

An easy computation shows that a„+i/a„ — >■ 1, so the ratio test is inconclu- 
sive. 

Next, try Raabe's test. Manipulating 



p 

lim n I ( — 1 1 = lim 



2n+3 
2n+2 



it becomes a 0/0 form and can be evaluated with L'Hospital's rule.^ 

2 / 3+2 n '^^ 



^^[mn P p 

lim ^— - — = -. 

n^oo (1 + n) (3 + 2n) 2 

From Raabe's test. Theorem 4.2.9, it follows that the series converges when 
p> 2 and diverges when p < 2. Raabe's test is inconclusive when p = 2. 
Now, suppose p = 2. Consider 

lim Inn ( n ( 1 ) — 1 ) = — lim Inn-^ — ^\ = 0 



ftn+i J J 4 (1 + nf 

and Bertrand's test. Theorem 4.2.10, shows divergence. 
The series (4.11) converges only when p> 2. 



4.3 Absolute and Conditional Convergence 

The tests given above are for the restricted case when a series has positive 
terms. If the stipulation that the series be positive is thrown out, things becomes 
considerably more complicated. But, as is often the case in mathematics, some 
problems can be attacked by reducing them to previously solved cases. The 
following definition and theorem show how to do this for some special cases. 

2See §7.5.2. 
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Figure 4.3: This plot shows the first 35 partial sums of the alternating harmonic 
series. It converges to In 2 « 0.6931, which is the level of the dashed line. Notice how 
the odd partial sums decrease to In 2 and the even partial sums increase to In 2. 



Definition 4.3.1. Let ^a„ be a series. If X^knl converges, then J^'^n is 
absolutely convergent. If it is convergent, but not absolutely convergent, then it 
is conditionally convergent. 

Since I'^nl ^ positive series, the preceding tests can be used to deter- 
mine its convergence. The following theorem shows that this is also enough for 
convergence of the original series. 

Theorem 4.3.1. //^a„ is absolutely convergent, then it is convergent. 
Proof. Let e > 0. Theorem 4.1.3 yields an e N such that when n > m > N, 



n 
fe=m 



J2 

k—m 



> 0. 



Another application Theorem 4.1.3 finishes the proof. 



□ 



Example 4.3.1. The series ^ (— l)"+^/n is called the alternating harmonic se- 
ries. Since the harmonic series diverges, we see the alternating harmonic series 
is not absolutely convergent. 

On the other hand, if s„ = X]fc=i(^l)'""'^^/^j then 



S2n = 



E 

k=l 



1 



2k -1 



1 

2k 



E 

k=l 



2fc(2fc- 1) 



is a positive series that converges. Since \s2n — S2n-i| = l/2n 0, it's clear 
that S2ri-i must also converge to the same limit. Therefore, s„ converges and 
J2 (— l)"^^/fi is conditionally convergent. (See Figure 4.3.) 
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To summarize: absolute convergence implies convergence, but convergence 
does not imply absolute convergence. 

There are a few tests that address conditional convergence. Following are 
the most well-known. 

Theorem 4.3.2 (Abel's Test). Let an and bn be sequences satisfying 

(a) Sn = J2k=i is a bounded sequence. 

(b) bn > bn+u Vn e N 

(c) bn^Q 

Then ^ a„6„ converges. 

To prove this theorem, the following lemma is needed. 
Lemma 4.3.3 (Summation by Parts). For every pair of sequences an and bn, 

n n n k 

a-kbk = bn+i ^O'k- ^{bk+i - bk) ^ ae 
fe=i k=i fe=i e=i 

Proof. Let s„ = J2'k=i ^k and sq = 0- Then 



k=l 



^(sfc - Sk 
fe=i 


-i)bk 


n 

y~] Skbk - ] 

fe=l k 


n 

^Sk-ibk 
=1 


n 

^Skbk - 
fe=i 


Skbk+l — Snbn+1 

Vfe=i 


n 

bn+1 ^ O/c 
fe=l 


n k 

-^(6fc+i-6fe)^( 
fe=i e=i 



ae 



□ 

Proof. To prove the theorem, suppose |X]fe=i ak\ < M for all n e N. Let £ > 0 
and choose N gN such that 6jv < e/2M. li N < m < n, use Lemma 4.3.3 to 
write 



m— 1 



n n 

bn+1 "^ae- ^(6^+1 - bi) ^ ak 
e=i e=i fe=i 

/ m-l m-l ^ N 

- [bm^ ae- ^ {be+i - be) ^ ak 



e=i 



fe=i 
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Using (a) gives 

n 

< (6„+i + b^)M + M ^ \be+i - be\ 

Now, use (b) to see 

n 

= {bn+1 + bm)M + M ^ (6^ - bi+i) 

and then telescope the sum to arrive at 

= {bn+l + bm)M + M{bm - ^n+l) 
= 2Mbm 

< e 

This shows X]"=i '^^^^ satisfies Theorem 4.1.3, and therefore converges. □ 

There's one special case of this theorem that's most often seen in calculus 
texts. 

Corollary 4.3.4 (Alternating Series Test). //c„ decreases to 0, then the series 

5^ (— l)""'"^c„ converges. 

Proof. Let a„ = (-l)"+i and b„ = c„ in Theorem 4.3.2. □ 

A series such as that in Corollary 4.3.4 is called an alternating series. More 
formally, if a„ is a sequence such that a„/an-i-i < 0 for all n, then ^a„ is 
an alternating series. Informally, it just means the series alternates between 

positive and negative terms. 

Example 4.3.2. Corollary 4.3.4 provides another way to prove the alternating 
harmonic series in Example 4.3.1 converges. Figure 4.3 shows how the partial 
sums bounce up and down across the sum of the series. 



4.4 Rearrangements of Series 

We want to use our standard intuition about adding lists of numbers when 
working with series. But, this intuition has been formed by working with finite 
sums and does not always work with series. 

Example 4.4.1. Suppose E (-l)"+7^ = 7 so that (-l)"+^2/n = 27. It's 
easy to show 7 > 1/2. Consider the following calculation. 

= 2-1+2-1 + 2-1+... 
3 2 5 3 
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Rearrange and regroup. 



.^ 1 1\ 1 /2 1\ 1 

= (^-l)-2n3-3j-4n5-5)-6+- 

, 1 1 1 
=^-2+3-4+- 

= 7 

So, 7 = 27 with 7^0. Obviously, rearranging and regrouping of this series is 
a questionable thing to do. 

In order to carefully consider the problem of rearranging a series, a precise 
definition is needed. 

Definition 4.4.1. Let a : N — > N be a bijection and be a series. The new 

series Yl oicr(n) is a rearrangement of the original series. 

The problem with Example 4.4.1 is that the series is conditionally convergent. 

Such examples cannot happen with absolutely convergent scries. For the most 
part, absolutely convergent series behave as wc arc intuitively led to expect. 

Theorem 4.4.1. If^an is absolutely convergent and X]Q'<T(n) is a rearrange- 
ment ofY^an, thenJ2aa{n) = E^n- 

Proof. Let J2^n = s and £ > 0. Choose A'' e N so that N < m < n implies 
Ylk=m kfel < ^- Choose M > N such that 

{l,2,...,7V}cMl),a(2),...,a(M)}. 

If P > M, then 



k=N+l 

and both series converge to the same number. □ 



^ aft - ^ a<T(/c) 
fe=i fe=i 



When a series is conditionally convergent, the result of a rearrangement is 
unpredictable. This is shown by the following theorem. 

Theorem 4.4.2 (Riemann Rearrangement). // ^ a„ is conditionally conver- 
gent and c € MU{— 00, 00}, then there is a rearrangement a such that X] ao-(n) = 
c. 

To prove this, the following lemma is needed. 
Lemma 4.4.3. // J2 '^n is conditionally convergent and 

Ja„, a„>0 )-a„, a„ < 0 

On = { and Cn = < , 

[0, a„<0 [0, an>0 

then both ^ 6„ and J2 diverge. 
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Proof. Suppose Yl converges. By assumption, ^ a„ converges, so Theorem 
4.1.1 implies 

converges. Another application of Theorem 4.1.1 shows 

converges. This is a contradiction of the assumption that ^ a„ is conditionally 
convergent, so ^ 6„ cannot converge. 

A similar contradiction arises under the assumption that ^ c„ converges. □ 



Proof. (Theorem 4.4.2) Let 6„ and c„ be as in Lemma 4.4.3 and define the 

siibscqucncc a+ of 6„ by removing those terms for which 6„ = 0 and a„ 7^ 0. 
Define the subsequence a~ of c„ by removing those terms for which c„ = 0. 
The series X^^^ and X^^i are still divergent because only terms equal 
to zero have been removed from &„ and c„ . 

Now, let c G M and mo = no = 0. According to Lemma 4.4.3, we can define 
the natural numbers 



mi = min{n : > c} and ni = min{n : + < c}. 

fc=i fe=i ^=1 

If THp and rip have been chosen for some p e N, then define 




fe=0 \t=mk + l e=nk + l / t=mp + l 



np+i = min <^ n : ^ ^ a| - ^ a7 



k=0 V=mfe + 1 e=nk+l J 

+ E E 



< c 



Consider the scries 

ai + at + ---+a+^-a^ -a^ a" (4.12) 

+ a'l 



+ 



'mi + l 


+ + ■ ' 






'm2 + l 


+ ^^^^12+2 + ■ ' 
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It is clear this scries is a rearrangement of Yl'^=i and the way in which rUp 
and Up were chosen guarantee that 

0 < E E < - E ""^^ E «n - c < "i., 

fe=0 \l=mh+\ t=nk + l fe=mp + l y 

and 

p / mfc+i rife \ 

0 < ^ - E E - E "7 p V 

k=0 \e=mk + l £=nk + l / 

Since both — >■ 0 and — >■ 0, the result follows from the Squeeze Theorem. 
The argument when c is infinite is left as Exercise 4.31. □ 

4.5 Exercises 

4.1. Prove Theorem 4.1.3. 

4.2. If X^^i On is a convergent positive series, then does X^^i converge? 

•DO 

4.3. The series ^^(fln — On+i) converges iff the sequence a„ converges. 

n=l 

4.4. Prove or give a counter example: If ^ |a„| converges, then na„ — >■ 0. 

4.5. If the series ai + 02 + as + • • • converges to S, then so does 

ai +0 + 02 + 0 + 0 + as + 0 + 0 + 0 + 04 H • (4.13) 

4.6. If X^J^i On converges and 6„ is a bounded monotonic sequence, then 
J2n=i ^ribn Converges. 

4.7. Let Xn be a sequence with range {0,1,2,3,4,5,6,7,8,9}. Prove that 
a;„10~" converges. 

4.8. Write 6.17272727272 • • • as a fraction. 

4.9. Prove the ratio test by setting p„ = 1 for all n in Kummer's test. 

4.10. If '^n and X^^TLi arc positive series and lim exists and is 

not zero, then both series converge or both series diverge. (This is called the 
limit comparison test.) 

4.11. Consider the series 

. , 1111 

l + H 1 1 \ 1 =4. 

2 2 4 4 
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Show that the ratio test is inconclusive for this series, but the root test gives a 
positive answer. 

4.12. Does 

1 1x2 1x2x3 1x2x3x4 

+ 1. ; ^ + ^ ^ ^ — ^ + --- 



3 3x5 3x5x7 3x5x7x9 



converge? 



4.13. For what values of p does 



2-4/ V2-4-6 



+ • 



converge? 



4.14. Find sequences a„ and 6„ satisiying: 

(a) a„ > 0,Vn e N and a„ 0; 

(b) Bn = bk is a bounded sequence; and, 

(c) Er=i o-nbn diverges. 

4.15. Let Qn be a sequence such that a2n ^ and a2n — o,2n-i 0- Then 
a„ A. 

4.16. Prove Bertrand's test. Theorem 4.2.10. 

4.17. Let J2^n be a positive series. Prove that X)a„ converges if and only if 
there is a sequence of positive numbers p„ and a > 0 such that 

,. On 

hm pn Pn+1 = a. 

n-i-oo a„+i 

(Hint: If s = a„ and s„ = J2k=i o-k, then let p„ = (s - Sn)/an.) 

4.18. Prove that X^^q ^ converges for all a; e M. 

4.19. Find all values of x for which X)fe^o ^^(^ + ^)'^ converges. 

4.20. For what values of x does the series 



converge? 



— converge absolutely, converge 

n4" 

Tl=l 

conditionally or diverge? 
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n + 6 

4.22. For what values of x does > — converge absolutely, converge 

^ n-^(x — 1)" 

conditionally or diverge? 

4.23. For what positive values of a does Yl^=i converge? 

4.24. Prove that > cos — sin — converges. 

4.25. For a series X^^^ with partial sums s„, define 

1 " 

Prove that if X]fc°=i '^n = then o"„ — > s. Find an example where (j„ converges, 
but Y^^=i does not. (If i7„ converges, the sequence is said to be Cesdro 
summable.) 

4.26. If is cL sequence with, a, subsequence bn^, then bn is a subseries 
of X^^i dn- Prove that if every subseries of X^^i ctn converges, then X^^^ (Xn 
converges absolutely. 

4.27. If is ^ convergent positive series, then so is X^^i a^- Give an 
example to show the converse is not true. 

4.28. Prove or give a counter example: If X^^i dn converges, then X^^^ 
converges. 

4.29. For what positive values of a does X^^i a"n" converge? 

4.30. If a„ > 0 for all n G N and there is a p > 1 such that lim^^oo 
exists and is finite, then J2^^i a„ converges. Is this true for p=ll 

4.31. Finish the proof of Theorem 4.4.2. 

4.32. Leonhard Euler started with the equation 

X X ^ 

+ --, — = 0, 



x—1 1 — X 

transformed it to 



1 a; „ 

+ — = 0' 



1-1/a; 1-x 
and then used geometric series to write it as 

1 1 



1+x + x^ +x'^ -\ =0. (4.22) 

x 



Show how Euler did his calculation and find his mistake. 
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Chapter 5 

The Topology of M 



5.1 Open and Closed Sets 

Definition 5.1.1. A set G C M is open if for every x G G there is an e > 0 
sucli tliat {x — s,x + e) C G. A set C M is closed if F" is open. 

The idea is that about every point of an open set, there is some room inside 
the set on both sides of the point. It is easy to see that any open interval 
(a, 6) is an open set because if o < a; < 6 and e = min{a; — a,b — x}, then 
{x — e,x + e) C (a, b). 

Open half- lines are also open sets. For example, let x G (a, oo) and e = x — a. 
Then {x — e,x + e) C (o, oo). 

A singleton set {a} is closed. To see this, suppose x ^ a and e = \x — a\. 
Then a ^ (a; — £, a; + e), and {a}^ must be open. The definition of a closed set 
implies {a} is closed. 

A common mistake is to assume all sets are either open or closed. Most sets 
are neither open nor closed. For example, \i S = [a, 6) for some numbers a < b, 
then no matter the size of £ > 0, neither {a — s,a + e) nor (b — s,b + e) are 
contained in S ov S'^. 

Theorem 5.1.1. (a) If {G\ : X G A} is a collection of open sets, then 
Uaga is open. 

(b) // {Gfc : 1 < k < n} is a finite collection of open sets, then nfe=i is 
open. 

(c) Both 0 and M are open. 

Proof. (a) If X G UagA '^a, then there is a A^; e A such that x G G\.^ . Since 
G\^ is open, there is an e > 0 such that x G (a;— e, a;-|-e) c G\^ c Uasa ^a- 
This shows (Jaga is open. 

(b) If a: G nfc=i ^fei then a; G Gfc for 1 < fc < n. For each Gfc there is an Ek 
such that (a; — ek,x + Sk) C Gfc. Let e = mm{ek ■ 1 < k < n}. Then 
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{x — e,x + e) C Gk ioT 1 < k < n, so {x — s,x + e) C nfe=i ^k- Therefore 
rife^i Gk is open. 

(c) 0 is open vacuously. M is obviously open. 

□ 

5.1.1 Topological Spaces 

The preceding theorem provides the starting point for a fundamental area of 
mathematics called topology. The properties of the open sets of R motivate the 
following definition. 

Definition 5.1.2. For X a set, not necessarily a subset of M, let T C 

The set 7 is called a topology on X if {0, X} C 7 and 7 is closed under arbitrary 

unions and finite intersections. The pair {X,7) is called a topological space. 

The elements of T are the open sets of the topological space. The closed sets of 
the topological space are those sets whose complements are open. 

If 0 = {G C M : G is open}, then Theorem 5.1.1 shows (K, 0) is a topological 
space called the standard topology on M. While the standard topology is the most 
widely used one, there are many other possible topologies on K. For example, 
3i = {(a, oo) : a G M} U {M, 0} is a topology on M called the right ray topology. 
The collection H = {S C M. : S'^ is finite} U {0} is called the finite complement 
topology. The study of topologies is a huge subject, further discussion of which 
would take us too far afield. There are many fine books on the subject ([13]) to 
which one can refer. 

Applying DeMorgan's laws to the parts of Theorem 5.1.1 immediately yields 
the following, which is sometimes also used as the definition of the standard 
topology. 

Corollary 5.1.2. (a) // {Fx : X G A} is a collection of closed sets, then 
ClxeA '-'^ closed. 

(b) // {Fk : 1 < k < n} is a finite collection of closed sets, then Ufe=i ^fc 
closed. 

(c) Both 0 and R are closed. 

Surprisingly, 0 and M are both open and closed. They are the only subsets 

of the standard topology with this dual personality. Other topologies may have 
many sets that are both open and closed. Such sets in a topological space are 
often called clopen. 

5.1.2 Limit Points and Closure 

Definition 5.1.3. xq is a limit point^ of c K if for every e > 0, 
{xo-e,xo+e)nS\{xo} ^0. 

^This use of the term limit point is not universal. Some authors use the term accumulation 
point. Others use condensation point, although this is more often used for those cases when 
every neighborhood of xo intersects S in an uncountable set. 
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The derived set of S is 

5' = {a; : a; is a limit point of S}. 
A point xq G S \ S' is an isolated point of S. 

Notice that hmit points of S need not be elements of S, but isolated points 
of S must be elements of S. In a sense, limit points and isolated points are at 
opposite extremes. The definitions can be restated as follows: 

xo is a limit point of 5 iff Ve > 0 (5 fl (a;o — s,xq + s)\ {xq} 0) 

xo & S is an isolated point of 5 iff 3e > 0 (5 fl {xq — s,xo + s)\ {xq} = 0) 

Example 5.1.1. If S = (0, 1], then S' = [0, 1] and S has no isolated points. 

Example 5.1.2. If T = {1/n : n e Z \ {0}}, then T' = {0} and all points of T 

are isolated points of T. 

Theorem 5.1.3. xq is a limit point of S iff there is a sequence Xn & S \ {xq} 
such that Xn xq. 

Proof. (=>) For each n S N choose Xn & S (1 {xq — l/n,xo + 1/n) \ {xo}- Then 
\xn — xo\ < 1/ti for all n e N, so a;„ — )• xo- 

{<=) Suppose Xn is a sequence from x„ G S \ {xq} converging to xq- If 
£ > 0, the definition of convergence for a sequence yields an iV S N such 
that whenever n > N, then a;„ € 5 fl {xq — s,xo + s) \ {xq}- This shows 
S n {xo — s,xo + e)\ {xq} ^ 0, and xq must be a limit point of S. □ 

There is some common terminology making much of this easier to state. If 
e K and G is an open set containing xq, then G is called a neighborhood of 
Xq. The observations given above can be restated in terms of neighborhoods. 

Corollary 5.1.4. Let S cR. 

(a) Xq is a limit point of S iff every neighborhood of xq contains an infinite 
number of points from S. 

(b) Xq G S is an isolated point of S iff there is a neighborhood ofxo containing 
only a finite number of points from S. 

Following is a generalization of Theorem 3.3.3. 

Theorem 5.1.5 (Bolzano- Weierstrass Theorem). If S C M. is bounded and 
infinite, then S" 7^ 0. 

Proof. For the purposes of this proof, if / = [a, b] is a closed interval, let = 
[a, (a + b)/2] be the closed left half of / and = [{a + b)/2,b] be the closed 
right half of I. 

Suppose S' is a bounded and infinite set. The assumption that S is bounded 
implies the existence of an interval Ii = [—B, B] containing S. Since S is infinite, 
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at least one of the two sets /-p D *S' or C] S is infinite. Let I2 be either /-p or 
Ii such that /2 n 5 is infinite. 

If In is such that /„n5 is infinite, let 7„+i be either 7^ or I^, where n5 
is infinite. 

In this way, a nested sequence of intervals, /„ for n e N, is defined such that 
7„ n 5 is infinite for all n e N and the length of 7„ is B/2^~^ 0. According 
to the Nested Interval Theorem, there is an € K such that HnGN-^" ~ {2^0}- 

To see that xq is a limit point of S, let £ > 0 and choose n G N so that 
B/2"~^ < e. Then € 7„ c {xq — e,xo + s)- Since 7„ n S is infinite, we see 
S n {xo — s,xo + s)\ {xq} ^ 0. Therefore, xq is a limit point of S. □ 

Theorem 5.1.6. A set S cM. is closed iff it contains all its limit points. 

Proof. {=>) Suppose S is closed and xo is a limit point of S. If xq ^ S, then 
S"^ open implies the existence of e > 0 such that {xq — e,xo + e) Ci S = ^. This 
contradicts the fact that xq is a limit point of S. Therefore, xq G 5*, and S 
contains all its limit points. 

(<;=) Since S contains all its limit points, if xq ^ S, there must exist an £ > 0 
such that (a;o — £, + £) fl S = 0. It follows from this that S" is open. Therefore 
S is closed. □ 

Definition 5.1.4. The closure of a set S is the set S = S U S' . 

For the set S of Example 5.1.1, S = [0, 1]. In Example 5.1.2, T = {l/n:n€ 
Z\{0}} U {0}. According to Theorem 5.1.6, the closure of any set is a closed set. 
A useful way to think about this is that S is the smallest closed set containing 
S. This is made more precise in Exercise 5.2. 

Following is a generalization of Theorem 3.5.1. 

Corollary 5.1.7. 7/{F„ :nGN}isa nested collection of nonempty closed and 
bounded sets, then f]^^^Fn ^ 0. 

Proof. Form a sequence x„ by choosing a;„ G Fn for each n e N. Since the 
Fn are nested, {x^ ■ n € N} C -Fi, and the boundcdness of Fi implies .t„ is a 
bounded sequence. An application of Theorem 3.3.3 yields a subsequence j/„ of 
Xn such that V- It suffices to prove y G Fn for all n e N. 

To do this, fix 77,0 G N. Because j/„ is a subsequence of x„ and Xn^ G Fng, 
it is easy to see i/„ G F„q for all n > uq. Using the fact that Un — >■ y, we see 
y G F^^. Since Fn^ is closed. Theorem 5.1.6 shows y G F„(,. □ 

5.2 Relative Topologies and Connectedness 
5.2.1 Relative Topologies 

Another useful topological notion is that of a relative or subspace topology. 
In our case, this amounts to using the standard topology on R to generate a 
topology on a subset of M. The definition is as follows. 
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Definition 5.2.1. Let X C M. The set 5 C X is relatively open in X, if there 
is a set G, open in M, such that S = G (1 X . The set T C X is relatively closed 
in X, if there is a set F, closed in K, such that S = FdX. (If there is no chance 
for confusion, the simpler terminology open in X and closed in X is sometimes 
used.) 

It is left as exercises to show that if X c M and § consists of all relatively 
open subsets of X, then {X,§) is a topological space and T is relatively closed 
in X, if X \ T e S. (See Exercises 5.10 and 5.11.) 

Example 5.2.1. Let X = [0,1]. The subsets [0,1/2) = X D (-1,1/2) and 
(1/4, 1] = X n (1/4, 2) are both relatively open in X. 

Example J,.2.2. If X = Q, then {a; e Q : - < a; < ^2} = (-^2, v^) n Q = 
[- V2, V2] n Q is clopen relative to Q. 

5.2.2 Connected Sets 

One place where the relative topologies are useful is in relation to the following 
definition. 

Definition 5.2.2. A set S' C M is disconnected if there are two open intervals 
U and V such that U C^V = %,U C^S ^%,V C^S &nd S C UUV. Otherwise, 
it is connected. The sets U r\S and V f^S are said to be a separation of S. 

In other words, S is disconnected if it can be written as the union of two 
disjoint and nonempty sets that are both relatively open in S. Since both these 

sets arc complements of each other relative to S, thciv arc; both clopen in S. This, 
in tm"n, implies 5* is disconnected if it has a proper relatively clopen subset. 

Example 5.2.3. Let S = {x} be a set containing a single point. S is connected 
because there cannot exist nonempty disjoint open sets U and V such that 
Sf\U^$ and Sf\V^$. The same argument shows that 0 is connected. 

Example 5.2.4. If 5 = [-1, 0) U (0, 1], then U = (-2, 0) and V = (0, 2) arc open 
sets such that U C^V = %, U S ^%,V C^ S and S C U VJV . This shows S 
is disconnected. 

Example 5.2.5. The sets U — (— oo, v^) and V = (\/2, oo) arc open sets such 
that J7n V = 0, [/nQ ^ 0, FnQ 7^ 0 andQ C U \JV = M.\{^/2}. This shows 
Q is disconnected. In fact, the only connected subsets of Q are single points. 
Sets with this property are often called totally disconnected. 

The notion of connectedness is not really very interesting on R because the 
connected sets are exactly what one would expect. It becomes more complicated 
in higher dimensional spaces. 

Theorem 5.2.1. A nonempty set S C M. is connected iff it is either a single 
point or an interval. 
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Proof. (^) If S is not a single point or an interval, there must be numbers 
r < s < t such that r,t £ S and s ^ S. In this case, the sets U = (—00, s) and 
V = (s, 00) are a disconnection of S. 

(^) It was shown in Example 5.2.3 that a set containing a single point is 
connected. So, assume S is an interval. 

Suppose S is not connected with U and V forming a disconnection of S. 
Choose u £ UnS and v gVDS. There is no generality lost by assuming u < v, 
so that [u, v] C S. 

Let A = {x : [u,x) C U}. 

We claim A 7^ 0. To sec this, use the fact that U is open to find e G {0,v — u) 
such that {u — e,u + s) CU. Then u < u + e/2 < v, so u + e/2 G A. 
Define w = lub A. 

Since u S it is evident u < w < v and w G S. 

Ifw gU, then u < w < v and there is £ S (0, v — w) such that {w — e, w+e) C 
U and [u,w + e) C S because w + s < v. This clearly contradicts the definition 
of w, so w ^ U. 

liw gV, then there is an £ > 0 such that {w ~e,w\ C V. In particular, this 
shows w = lub A < w — e < w. This contradiction forces the conclusion that 
w(^V. 

Now, putting all this together, we see w & S C U UV and w ^ U UV. This 
is a clear contradiction, so we're forced to conclude there is no separation of 
S. □ 



5.3 Covering Properties and Compactness on R 
5.3.1 Open Covers 

Definition 5.3.1. Let 5 C M. A collection of open sets, 0 = {G\ : A € A}, is 

an open cover of S' if S' C Uceo If 0' C 0 is also an open cover of S, then 
0' is an open subcover of S from 0 . 

Example 5.3.1. Let S = (0,1) and 0 = {(1/n, 1) : n € N}. We claim that 0 
is an open cover of S. To prove this, let x G (0, 1). Choose no € N such that 
1/no < X. Then 

X e (l/no, 1) C y (1/n, 1) = U ^- 

neN Geo 

Since x is an arbitrary element of (0, 1), it follows that (0, 1) = Uceo ^• 

Suppose 0' is any infinite subset of 0 and x € (0,1). Since 0' is infinite, 
there exists an n € N such that a; e (1/n, 1) e 0'. The rest of the proof proceeds 
as above. 

On the other hand, if 0' is a finite subset of 0, then let M = max{n : 
(1/n, 1) e 0'}. If 0 < a; < 1/M, it is clear that x ^ Uceo' ^' ^° 0' 
open cover of (0, 1). 

2 
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Example 5.3.2. Let T = [0, 1) and 0 < e < 1. If 

0 = {(l/n,l) :neN}U {(-£,£)}, 

then 0 is an open cover of T. 

It is evident that any open subcover of T from 0 must contain (— e,e), 
because that is the only element of 0 which contains 0. Choose n G N such that 
1/n < e. Then 0' = {(— e,e), (1/n, 1)} is an open subcover of T from 0 which 
contains only two elements. 

Theorem 5.3.1 (Lindclof Property). IfScM. and 0 is any open cover of S, 
then 0 contains a subcover with a countable number of elements. 

Proof. Let 0 = {Gx : A € A} be an open cover of S C M. Since 0 is an open 
cover of S, for each x G S there is a A^; e A and numbers Px,Qx ^ Q satisfying 
2^ S (Pxj 9x) C e 0. The collection T = {{px, Qx) '■ x G S} is a.n open cover 
of 5. 

Thinking of the collection 7 = {{px,qx) x G S} as a set of ordered pairs of 
rational numbers, it is seen that card (T) < card (Q x Q) = Hq, so T is countable. 
For each interval I g7, choose a A/ G A such that / C Gxj ■ Then 

5 C U 7 C U Ga, 

/GT 7GT 

shows 0' = {Gaj : / G T} C 0 is an open subcover of S from 0. Also, 
card (0') < card {7) < Hq, so 0' is a countable open subcover of S from 0. □ 

Corollary 5.3.2. Any open subset ofM. can be written as a countable union of 
pairwise disjoint open intervals. 

Proof. Let G be open in R. For x € G let = gib {y : (y, x] C G} and ^Bx = 
lub{y : [x,y) C G}. The fact that G is open implies ax < x < j3x- Define 
Ix = {ax.Px)- 

Then Ix C G. To see this, suppose x < w < f3x- Choose y G {w,Px)- The 
definition of (3x guarantees w G {x,y) C G. Similarly, if a^, < w < x, it follows 
that w gG. 

This shows 0 = {Ix : x G G} has the property that G = U^eG 

Suppose x,y G G and IxH ly ^ 0. There is no generality lost in assuming 
X < y. In this case, there must he a, w G (x, y) such that w G Ix Ci ly. We know 
from above that both [x,w] C G and [w,y] C G, so [x,y] C G. It follows that 
ax = ay < X < y < j3x = I3y and Ix = ly 

Prom this we conclude 0 consists of pairwise disjoint open intervals. 

To finish, apply Theorem 5.3.1 to extract a countable subcover from 0. □ 

Corollary 5.3.2 can also be proved by a different strategy. Instead of using 
Theorem 5.3.1 to extract a countable subcover, we could just choose one rational 
number from each interval in the cover. The pairwise disjointness of the intervals 
in the cover guarantee this will give a bijection between 0 and a subset of Q. 
This method has the advantage of showing 0 itself is countable from the start. 
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5.3.2 Compact Sets 

There is a class of sets for which the conclusion of Lindelof 's theorem can be 

strengthened. 

Definition 5.3.2. An open cover 0 of a set 5 is a finite cover, if 0 has only a 
finite number of elements. The definition of a finite subcover is analogous. 

Definition 5.3.3. A set if C M is compact, if every open cover of K contains 
a finite subcover. 

Theorem 5.3.3 (Heine-Borel) . A set K c M. is compact iff it is closed and 
bounded. 

Proof. (^) Suppose K is unbounded. The collection 0 = {(— n, n) : n G N} is 
an open cover of K. If 0' is any finite subset of 0, then UgsO' G is a bounded 
set and cannot cover the unbounded set K. This shows K cannot be compact, 
and every compact set must be bounded. 

Suppose K is not closed. According to Theorem 5.1.6, there is a limit point 
X oi K such that x ^ K. Define 0 = {[x - 1/n, x + Xjnf : n € N}. Then 0 is 
a collection of open sets and K c Uceo G = M \ {x}. Let 0' = Wx— l/ni,x + 
l/mY : 1 < i < N} he a finite subset of 0 and M = max{nj : 1 < i < iV}. 
Since x is a limit point of K, there is a. y G K (1 {x — 1/M,x + Clearly, 
y ^ UceO' G = [x — 1/M,x + 1/MY, so 0' cannot cover K. This shows every 
compact set must be closed. 

{<=) Let K be closed and bounded and let 0 be an open cover of K. Applying 
Theorem 5.3.1, if necessary, we can assume 0 is countable. Thus, 0 = {G„ : 
n e N}. 

For each n G N, define 

n n 

Fn = K\\jGi = Knf]Gl 

1=1 i=l 

Then F„ is a sequence of nested, bounded and closed subsets of K. Since 0 
covers K, it follows that 

f]FnCK\\jGn = ^. 

According to the Corollary 5.1.7, the only way this can happen is if F„ = 0 for 
some n G N. Then K C Ur=i ^^'^ ^' ~ i^i : 1 < i < n} is a finite subcover 
of if from 0. □ 

Compactness shows up in several different, but equivalent ways on K. We've 
already seen several of them, but their equivalence is not obvious. The following 
theorem shows a few of the most common manifestations of compactness. 

Theorem 5.3.4. Let K cM.. The following statements are equivalent to each 
other. 
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(a) K is compact. 

(b) K is closed and bounded. 

(c) Every infinite subset of K has a limit point in K. 

(d) Every sequence {an : n G N} C K has a subsequence converging to an 
element of K. 

(e) // Fn is a nested sequence of nonempty relatively closed subsets of K, then 

Proof, (a) <;=^ (b) is the Hcinc-Borcl Theorem, Theorem 5.3.3. 

That (b)=J>(c) is the Bolzano- Weierstrass Theorem, Theorem 5.1.5. 

(c) =>(d) is contained in the sequence version of the Bolzano- Weierstrass 
theorem, Theorem 3.3.3. 

(d) =>(e) Let F„ be as in (e). For each n € N, choose a„ € F„. By assumption, 
a„ has a convergent subsequence 6„ — )• b. Each F„ contains a tail of the sequence 
6„, so 6 € C Fn for each n. Therefore, b G HngN-^"' (®) follows. 

(e) =>(b). Suppose K is such that (e) is true. 

Let Fn = K f\ ((— oo,— n] U [n,oo)). Then F„ is a sequence of sets which 
are relatively closed in K such that HneN-^" = 0. If if is imboimded, then 
Fn 7^ 0, Vn S N, and a contradiction of (e) is evident. Therefore, K must be 
bounded. 

If K is not closed, then there must be a limit point x of K such that x ^ K. 
Define a sequence of relatively closed and nested subsets of K hy Fn = [x — 
l/n,x + l/n]r[ K iov n e N. Then flneN-^" = 0' because x ^ K. This 
contradiction of (e) shows that K must be closed. □ 

These various ways of looking at compactness have been given different 
names by topologists. Property (c) is called limit point compactness and (d) 
is called sequential compactness. There are topological spaces in which various 
of the equivalences do not hold. 

5.4 Dense Sets and Category 

We've already seen one way in which a subset of M can be considered small — if 

its cardinality is at most Hg. Such sets are small in the set-theoretic sense. In 
this section, one way a set can be considered small in the topological sense is 
presented. 

5.4.1 Dense and Nowhere Dense Sets 

We begin by considering a way that a set can be considered topologically large 
in an interval. If I is any interval, recall from Corollary 2.3.8 that /flQ ^ 0 and 
7 n Q'^ 0. An immediate consequence of this is that every real number is a 
limit point of both Q and Q°. In this sense, the rational and irrational numbers 
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are both uniformly distributed across the number hne. This idea is generahzed 

in the following definition. 

Definition 5.4.1. Let Ac B cR. A is said to be dense in B, if B c A. 

Both the rational and irrational numbers are dense in every interval. Corol- 
lary 5.3.2 then shows the rational and irrational numbers are dense in every 
open set. It's not hard to construct other sets dense in every interval. For 
example, the set of dyadic numbers, D = : p,q Q Z}, is dense in every 

interval — and dense in the rational numbers. 

On the other hand, Z is not dense in any interval because it's closed and 
contains no interval. If ^ C i?, where B is an open set, then A is not dense in 
B, if A contains any interval-sized gaps. 

Theorem 5.4.1. Let A <z B c M.. A is dense in B iff whenever I is an open 
interval such that I flB then I (lAj^fl). 

Proof. (=>) Assume there is an open interval / such that iCiB ^ 0 and IdA = 0. 
li X G IDB, then / is a neighborhood of x that does not intersect A. Definition 
5.1.3 shows X ^ A' C A, a. contradiction of the assumption that B C A. This 
contradiction implies that whenever I H B j^^, then / n A 7^ 0. 

{<=) If X G B (1 A = A, then x G A. Assume x £ B\A. By assumption, for 
each n G N, there is an .t„ G (x — l/n,x + 1/n) (1 A. Since Xn x, this shows 
X G A' CA. It now follows that B cA. □ 

If _B C M and / is an open interval with I O B 7^ 0, then / n S is often called 
a portion of B. The previous theorem says that A is dense in B iff every portion 
of B intersects A. 

If A being dense in B is thought of as A being a large subset of B, then 
perhaps when A is not dense in _B, it can be thought of as a small subset. But, 
thinking of A as being small when it is not dense isn't quite so clear when it is 
noticed that A could still be dense in some portion of B, even if it isn't dense in 
B. To make A be a truly small subset of B in the topological sense, it should 
not be dense in any portion of B. The following definition gives a way to assure 
this is true. 

Definition 5.4.2. Let A c B c M.. A is said to be nowhere dense in B if B\A 
is dense in B. 

The following theorem shows that a nowhere dense set is small in the sense 
mentioned above because it fails to be dense in any part of B. 

Tiieorem 5.4.2. Let A C B C R. A is nowhere dense in B iff for every 
open interval I such that I (1 B ^ 0, there is an open interval J C I such that 
JnB ^9 and JnA = 9. 

Proof. (^) Let / be an open interval such that IDB 7^ 0. By assumption, B\A 
is dense in B, so Theorem 5.4.1 implies ln{B\A) j^lj). If x € I n {B\A), then 
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there is an open interval J such that x € J C I and J fl A = 0. Since A C A, 
this J satisfies the theorem. 

{<=) Let I be an open interval with Ir\B By assumption, there is an 
open interval J C I such that J fl A = 0. It follows that J fl j4 = 0. Theorem 
5.4.1 implies B\A is dense in B. □ 

Example 5.4.1. Let G be an open set that is dense in M. If / is any open interval, 
then Theorem 5.4.1 implies / fl G 7^ 0. Because G is open, if a; S I O G, then 
there is an open interval J such that x G J C G. Now, Theorem 5.4.2 shows G" 
is nowhere dense. 

The nowhere dense sets are topologically small in the following sense. 

Theorem 5.4.3 (Baire). /// is an open interval, then I cannot be written as 

a countable union of nowhere dense sets. 

Proof. Let A„ be a sequence of nowhere dense subsets of I. According to 
Theorem 5.4.2, there is a bounded open interval Ji C / such that JiHAi = 0. By 
shortening Ji a bit at each end, if necessary, it may be assumed that Ji CiAi = 0. 
Assume J„ has been chosen for some n G N. Applying Theorem 5.4.2 again, 
choose an open interval Jn+i as above so J„+i n An+i = 0. Corollary 5.1.7 
shows 

and the theorem follows. □ 

Theorem 5.4.3 is called the Baire category theorem because of the terminol- 
ogy introduced by Rene-Louis Baire in 1899.^ He said a set was of the first 
category, if it could be written as a countable union of nowhere dense sets. An 
easy example of such a set is any countable set, which is a countable union of 
singletons. All other sets are of the second category.'^ Theorem 5.4.3 can be 
stated as "Any open interval is of the second category." Or, more generally, as 
"Any nonempty open set is of the second category." 

A set is called a Gg set, if it is the countable intersection of open sets. It 
is called an Fo- set, if it is the countable union of closed sets. DeMorgan's laws 
show that the complement of an set is a set and vice versa. It's evident 
that any countable subset of M is an F,^ set, so Q is an F^^ set. 

On the other hand, suppose Q is a set. Then there is a sequence of 
open sets G„ such that Q = PlneN^"- Since Q is dense, each G„ must be 
dense and Example 5.4.1 shows G^ is nowhere dense. From DeMorgan's law, 
K = Q U UnGN^ii' showing M is a first category set and violating the Baire 
category theorem. Therefore, Q is not a Gg set. 

■^Rcnc-Louis Baire (1874-1932) was a French mathematician. He proved the Baire category 
theorem in his 1899 doctoral dissertation. 

^Baire did not define any categories other than these two. Some authors call first category 
sets meager sets, so as not to make students fruitlessly wait for definitions of third, fourth and 
fifth category sets. 
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Essentially the same argument shows any countable subset of M is a first 
category set. The following protracted example shows there are uncountable 
sets of the first category. 



5.4.2 The Cantor Middle-Thirds Set 

One particularly interesting example of a nowhere dense set is the Cantor 
Middle-Thirds set, introduced by the German mathematician Georg Cantor 
in 1884.^ It has many strange properties, only a few of which will be explored 
here. 

To start the construction of the Cantor Middle-Thirds set, let Co — [0, 1] 
and Ci^hX (1/3, 2/3) = [0, 1/3] U [2/3, 1]. Remove the open middle thirds of 
the intervals comprising Ci, to get 



1 

«'9 



U 



U 



u 



9'^ 



Continuing in this way, if C„ consists of 2" pairwise disjoint closed intervals each 
of length 3~", construct C„+i by removing the open middle third from each of 
the closed intervals, leaving 2"+^ closed intervals each of length 3"^""'"'^^. This 
gives a nested sequence of closed sets C„ each consisting of 2" closed intervals 
of length 3"". (See Figure 5.1.) The Cantor Middle- Thhds set is 

C= f]Cn. 

new 



Co 



Figure 5.1: Shown here are the first few steps in the construction of the Cantor 
Middle- Thirds set. 

Corollaries 5.1.2 and 5.1.7 show C is closed and nonempty. In fact, the 
latter is apparent because {0, 1/3, 2/3, 1} C C„ for every n. At each step in the 
construction, 2" open middle thirds, each of length 3~*^"+^^ were removed from 
the intervals comprising C„. The total length of the open intervals removed was 

°° 2" l°°/2\" 

n=0 n=a ^ ' 



^Cantor's originaf work [6] is reprinted with an English translation in Edgar's Classics on 
Fractals [8]. Cantor only mentions his eponymous set in passing and it had actually been 
presented earlier by others. 
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Because of this, Example 5.4.1 implies C is nowhere dense in [0, 1]. 

C is an example of a perfect set, i.e., a closed set all of whose points are limit 
points of itself. (See Exercise 5.23.) Any closed set without isolated points is 
perfect. The Cantor Middle-Thirds set is interesting because it is an example of 
a perfect set without any interior points. Many people call any bounded perfect 
set without interior points a Cantor set. Most of the time, when someone refers 
to the Cantor set, they mean C . 

There is another way to view the Cantor set. Notice that at the nth stage 
of the construction, removing the middle thirds of the intervals comprising C„ 
removes those points whose base 3 representation contains the digit 1 in position 
n + 1. Then, 

C=|c=E|^:c„e{0,2}|. (5.1) 

So, C consists of all numbers c e [0, 1] that can be written in base 3 without 
using the digit 1.^ 

If c € C, then (5.1) shows c = Y^^=i some sequence c„ with range 

in {0, 2}. Moreover, every such sequence corresponds to a unique element of C. 
Define (/> : C -i^ [0, 1] by 

^(^)=^(|:|)=|:'2^- (5-2) 

Since c„ is a sequence from {0, 2}, then c„/2 is a sequence from {0, 1} and 0(c) 
can be considered the binary representation of a number in [0, 1] . According to 
(5.1), it follows that </) is a surjection and 

<t>(C) = jf: ^ : c„ G {0,2}| = jf: ^ : 6„ e {0, 1}| = [0, 1]. 

Therefore, card (C) = card ([0, 1]) > Nq. 

The Cantor set is topologically small because it is nowhere dense and large 
from the set-theoretic viewpoint because it is uncountable. 

5.5 Exercises 

5.1. If G is an open set and F is a closed set, then G \ F is open and F \ G is 
closed. 

5.2. Let 5 C K and J = {F : F is closed and S C F}. Prove 5 
This proves that S is the smallest closed set containing S. 

^Notice that 1 = Y.n=i 2/3", 1/3 = E^=2 2/3", etc. 
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5.3. If S and T are subsets of R, then SUT = SUT. 

5.4. If 5 is a finite subset of R, then S is closed. 

5.5. Q is neither open nor closed. 

5.6. A set 5* C M is open iff 95* n 5 = 0. {dS is the set of boundary points of 
S.) 

5.7. Find a sequence of open sets G„ such that HneN neither open nor 
closed. 

5.8. An open set G is called regular if G = {G)°. Find an open set that is not 
regular. 

5.9. Let 3i = {(x,oo) : x e M} and T = 3^ U {M,0}. Prove that (R,T) is a 
topological space. This is called the right ray topology on R. 

5.10. If X C M and S is the collection of all sets relatively open in X, then 
{X, S) is a topological space. 

5.11. If X c R and G is an open set, then X\G is relatively closed in X. 

5.12. For any set 5, let J = {T C S" : card (S \ T) < Kq} U {0}. Then {S, J) 
is a topological space. This is called the finite complement topology. 

5.13. An uncountable subset of R must have a limit point. 

5.14. If 5 c R, then S' is closed. 

5.15. Prove that the set of accumulation points of any sequence is closed. 

5.16. Prove any closed set is the set of accumulation points for some sequence. 

5.17. If a„ is a sequence such that a„ L, then {a„ : n e N} U {L} is 

compact. 

5.18. If is closed and K is compact, then F (1 K is compact. 

5.19. If {Ka : a e ^} is a collection of compact sets, then f]^^j^Ka is 

compact. 

5.20. Prove the union of a finite number of compact sets is compact. Give 
an example to show this need not be true for the union of an infinite number of 
compact sets. 

5.21. (a) Give an example of a set S such that S is disconnected, but 5U {1} 
is connected, (b) Prove that 1 must be a limit point of S. 
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5.22. If K is compact and V is open with K C V, then there is an open set 

U such that K CU CU CV. 

5.23. If C is the Cantor middle-thirds set, then C = C". 

5.24. If a; € K and K is compact, then there is a z € K such that |x — 2| = 

glb{|x — t/| : t/ e K}. Is z unique? 

5.25. If K is compact and 0 is an open cover of K, then there is an e > 0 
such that for all x G K there is a G e 0 with {x — e,x + e) C G. 

5.26. Let / : [a, 6] — )■ M be a function such that for every x S [a, b] there is a 
Sx > 0 such that / is bounded on (x — Sx,x + 6x)- Prove / is bounded. 

5.27. Is the fimction defined by (5.2) a bijcction? 

5.28. If A is nowhere dense in an interval /, then A contains no interval. 

5.29. Use the Baire category theorem to show R is uncountable. 
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Chapter 6 

Limits of Functions 



6.1 Basic Definitions 

Definition 6.1.1. Let £» C M, be a limit point of £> and / : ^ M. Tlie 
limit of f{x) at xq is L, if for each e > 0 there is a J > 0 such that when x ^ D 
with 0 < |a; — Xo\ < S, then \f{x) ~ L\ < e. When this is the case, we write 
lim^^a;„ f{x) ^ L. 

It should be noted that the limit of / at xq is determined by the values of / 
near xq and not at xq. In fact, / need not even be defined at xq. 



L + E - 

L - 
L-e - 




1 1 1 

Xq — 5 Xq Xq + S 



Figure 6.1: This figure shows a way to think about the limit. The graph of / must 
stay inside the box {xq — S, xq + S) x {L — e, L + e), except possibly the point (a:;o, f{xo)). 



A useful way of rewording the definition is to say that lima;-;.^;;, f{x) = L iff 
for every e > 0 there is a ^ > 0 such that x £ {xq — 6,xo + 6) 0 D\ {xq} implies 
f{x) £ {L — e,L + e). This can also be succinctly stated as 

Ve > 03(5 >0{f{{xo-5,xo + S)nD\{xo})c {L~e,L + e)). 

©Leo Larson (LLarsQniaLouisvilIc.edu) 
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Figure 6.2: The function from Example 6.1.3. Note that the graph is a line with one 
"hole" in it. 



Example 6.1.1. If f{x) = c is a constant function and xq G M, then for any 
positive numbers s and 6, 

X G {xo — 5,xo + S) (1 D \ {xo} => \ f{x) — c\ = \c — c\ = 0 < e. 

This shows the limit of every constant function exists at every point, and the 
limit is just the value of the function. 

Example 6.1.2. Let f{x) = x, xo G K, and s = S > 0. Then 

X G {xo - 6,xo + S) r\D\ {xo} |/(a;) - /(xo)| = \x - xo\ < S = e. 

This shows that the identity function has a limit at every point and its limit is 
just the value of the function at that point. 

Example 6.1.3. Let f{x) = . In this case, the implied domain of / is 

D = R\ {2}. We claim that lim^->2 f{x) = 8. 

To see this, let £ > 0 and choose S e (0, s/2). If 0 < |a; - 2| < 5, then 

2x2 - 8 



l/(^)-8| 



|2(x + 2)-8| =2|a;-2| < e. 



Example 6.1.4. Let f{x) = \/x + 1. Then the implied domain of / is Z) 

[— l,oo). Wc claim that lim^;^-! /(x) = 0. 



To see this, let e > 0 and choose b G (0,e^). If 0 < x - (-1) 



1 < (5, 



then 



\f{x) - oi = \/xTT <\rb <\r^ =E. 



Example 6.1.5. If /(.t) = for x ^ 0, then limj.^o f{x) docs not exist. (See 
Figure 6.3.) To see this, suppose lima;_,.o f{x) = L, s = 1 and 6 > 0. If L > 0 
and -S < X < 0, then f{x) = -1 < L - s. If L < 0 and 0 < x < 6, then 
f{x) = 1 > L + e. These inequalities show for any L and every 5 > 0, there is 
an X with 0 < |a;| < 5 such that \ f{x) — L\ > s. 

There is an obvious similarity between the definition of limit of a sequence 
and limit of a function. The following theorem makes this similarity explicit, 
and gives another way to prove facts about limits of functions. 
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Figure 6.3: The function f{x) = \x\/x from Example 6.1.5. 

Theorem 6.1.1. Let f : D ^M. andxo be a limit point ofD. lima;-).^;^ f{x) = L 
iff whenever Xn is a sequence from D\{xo} such that Xn xq, then f{xn) L. 

Proof. (=>) Suppose limx^xo f(^) = ^ and x„ is a sequence from D \ {xq} such 
that Xn — )■ xq- Let e > 0. There exists a S > 0 such that \ f{x) — L\ < e whenever 
X £ {x — 6,x + 6)r\D\ {xq}. Since a;„ — )■ Xq, there is an S N such that n> N 
implies 0 < |a;„ — a;o| < S. In this case, |/(a;„) — L\ < e, showing f{xn) L. 

(<;=) Suppose whenever a;„ is a sequence from D \ {xq} such that a;„ — > xq, 
then f{xn) — > L, but hma;-).^;^ f{x) ^ L. Then there exists an e > 0 such that 
for all ^ > 0 there is an a; e {xq — S, xo + S)r\D\{xo} such that \f{x) — L\ > e. In 
particular, for each n G N, there must exist a;„ G (.tq — 1/n, .xq + H -D\ {a^o} 
such that \f{xn) — L\> e. Since Xn — >■ a;o, this is a contradiction. Therefore, 
linix-s-xo f{x) = L. □ 

Theorem 6.1.1 is often used to show a limit doesn't exist. Suppose we want 
to show lima;_>a;Q f{x) doesn't exist. There are two strategies: find a sequence 

Xn ^ Xq such that f{xn) has no limit; or, find two sequences y„ — > xq and 
-Zn — ^ a;o such that /(j/n) and /(-Zn) converge to different limits. Either way, the 
theorem shows lim^_>^(, fails to exist. 

In Example 6.1.5, we could choose x^ = (— l)"/ri so oscillates between 

— 1 and 1. Or, we could choose t/„ — 1/n — so f{xn) — > 1 and f{zn) — > — 1. 

Example 6.1.6. Let /(a;) = sin(l/a;), = ^ and 6„ = (4„^i)^ ■ Then J, 0, 
&n 4 0, /(a„) = 0 and /(&„) = 1 for all n e N. An application of Theorem 6.1.1 
shows lima;^o f{x) does not exist. (See Figure 6.4.) 

Theorem 6.1.2 (Squeeze Theorem). Suppose f , g and h are all functions 
defined on D dR with f{x) < g{x) < h{x) for all x G D. If xq is a limit point 
of D and limj;^^;;, f{x) = lim^^xo H^) = L, then Ximx^xo 9{x) = L. 

Proof. Let Xn be any sequence from D \ {xq} such that Xn ^ Xq. According to 
Theorem 6.1.1, both f{xn) L and /i(x„) L. Since /(a;„) < g{xn) < h{xn), 
an application of the Sandwich Theorem for sequences shows g{xn) L. Now, 
another use of Theorem 6.1.1 shows limj;_>.j,Q g{x) = L. □ 
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Figure 6.4: This is the function from Example 6.1.6. The graph shown here is on 
the interval [0.03, 1]. There are an infinite number of oscillations from —1 to 1 on any 
open interval containing the origin. 

Example 6.1.7. Let f{x) = a:sin(l/a;). Since —1 < sin(l/x) < 1 when x 0, 
we see that —x < sin(l/a;) < x for x ^ 0. Since Iima;_i.o a; = hm2._>o — a; = 0, 
Theorem 6.1.2 implies lim2;_j.o a; sin(l/x) = 0. (See Figure 6.5.) 

Theorem 6.1.3. Suppose / : Z? — > M and g : D — > M and Xq is a limit point of 
D. If \imx^x„ fix) = L and lim^j^^ro gi^) = M, then 

(a) hm:,^:,„(/ + g){x) = L + M, 

(b) limx^xo{af){x) = aL, Vx G M, 

(c) lim^^xg {fg){x) ^ LM , and 

(d) \ivcLx^x^{l/ f){x) = 1/L, as long as L^O. 

Proof. Suppose a„ is a sequence from D \ {xq} converging to xq. Then Theo- 
rem 6.1.1 implies /(a„) — ?► L and g(a„) M. (a)-(d) follow at once from the 
corresponding properties for sequences. □ 

Example 6.1.8. Let f{x) — 3x + 2. If gi{x) = 3, 32(2:) = x and 33(0;) = 2, then 
f{x) = gi{x)g2{x) + g3{x) . Examples 6.1.1 and 6.1.2 along with parts (a) and (c) 
of Theorem 6.1.3 immediately show that for every a; S M, Imix^xo fi^) — fi^a)- 
In the same manner as Example 6.1.8, it can be shown for every rational 
function f{x), that lima;-^^;^ f{x) = f{xo) whenever f{xo) exists. 

6.2 Unilateral Limits 

Definition 6.2.1. Let f : D ^ R and xq be a limit point of n (— oo,a;o). 
/ has L as its left-hand limit at xq if for all £ > 0 there is a (S > 0 such that 
f{{xQ — 6, xq) n D) C (L ~ e,L + e). In this case, we write lima;^^;^ f{x) = L. 

Let / : — > M and xq be a limit point of n (xq, 00). / has L as its right- 
hand limit at xq if for all e > 0 there is a (5 > 0 such that f{{xo, xq + S) D D) C 
{L — e, L + e). In this case, we write liuixixo fi^) = L. 
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Figure 6.5: This is the function from Example 6.1.7. The bounding lines y — x and 
y = —X are also shown. There are an infinite number of oscillations between —x and 
X on any open interval containing the origin. 

Standard notations for the unilateral limits are 

lim f{x) = lim f{x) and lim /(x) — lim f{x). 

x^xq x^xq— x].xo x^xo-\- 

Example 6.2.1. As in Example 6.1.5, let f{x) = |a;|/a;. Then lim^^^o .f (2;) = 1 
and lmix^Qf{x) = —1. (See Figure 6.3.) 

Theorem 6.2.1. Let f : D ^ R and xq be a limit point of D. 

lim f{x) = L <F=^ lim f{x) = L = lim f{x) 

X-^Xo X^Xo Xlxg 

Proof. This proof is left as an exercise. □ 

Theorem 6.2.2. /// : {a,b) — > M is monotone, then both unilateral limits of f 
exist at every point of (a, b). 

Proof. To be specific, suppose / is increasing and xq G (a, Let e > 0 and 
L = lub{/(a:) : a < x < xq}. According to Corollary 2.3.4, there must exist an 
X € (a, Xq) such that L — e < f{x) < L. Define 6 — xq — x. If y £ {xq — S, xq), 
then L — e < f{x) < f{y) < L. This shows lim^j^j.^ f(x) = L. 
The proof that lim^-^ajo f{^) exists is similar. 

To handle the case when / is decreasing, consider — / instead of /. □ 

6.3 Continuity 

Definition 6.3.1. Let / : Z) — > M and xq E D. f is continuous at xq if for 
every e > 0 there exists a 6 > 0 such that when x E D with |x — xq] < S, then 
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f{xo) + e 









f 







Xq — S Xo Xq + 6 



Figure 6.6: The function / is continuous at xo, if given any e > 0 there is a 5 > 0 
such that the graph of / does not cross the top or bottom of the dashed rectangle 
(xo - 5,xo + d) X {f{xo) - e, f{xo) + e). 



\f{x) — /(a;o)| < £• The set of all points at which / is continuous is denoted 
C{f). 

Several useful ways of rephrasing this are contained in the following theorem. 
They are analogous to the similar statements made about limits. Proofs are left 
to the reader. 

Theorem 6.3.1. Let f : D ^ R and xq G D. The following statements are 
equivalent. 

(a) xo e C{f), 

(b) For all e > 0 there is a S > 0 such that 

X e {xo~ 5,XQ + 6)nD ^ fix) e (/(a;o) - e, f{xo) + e), 

(c) For all s > 0 there is a S > 0 such that 

/((xo -S,xo+S)r]D) c (/(xo) - e, /(xo) + e). 

Example 6.3.1. Define 

fix) 



2x^-8 



X 7^ 2 
X = 2 



It follows from Example 6.1.3 that 2 g C(/). 

There is a subtle difference between the treatment of the domain of the 
function in the definitions of limit and continuity. In the definition of limit, the 
"target point," xo is required to be a limit point of the domain, but not actually 
be an element of the domain. In the definition of continuity, xo must be in 
the domain of the function, but does not have to be a limit point. To see a 
consequence of this difference, consider the following example. 
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Example 6.3.2. If / : Z — )■ R is an arbitrary function, then C(/) = Z. To see 
this, let no G Z, e > 0 and 6 — 1. If a; S Z with \x — no\ < 6, then x = uq. It 
follows that |/(a;) — /(no)| = 0 < £, so / is continuous at riQ. 

This leads to the following theorem. 

Theorem 6.3.2. Let f : D ^ R and xq & D. If xq is a limit point of D, 
then xo G C{f) iff limx^xo /(^) = /(^o)- If xq is an isolated point of D, then 
xo G C(/). 

Proof. Uxq is isolated in D, then there is an 5 > 0 such that {xq — S, xo+5)nD = 
{xq}. For any e > 0, the definition of continuity is satisfied with this 6. 
Next, suppose xq G D'. 

The definition of continuity says that / is continuous at xq iff for all e > 0 
there is a 5 > 0 such that when x G {xq — 6,xo + S) (1 D, then f{x) G {f{xo) — 
s,f{xo) + £)- 

The definition of limit says that Muix^xo f{x) = /(xq) iff for all e > 0 
there is a 5 > 0 such that when x G {xq — S,xo + S) H D \ {xq}, then f{x) G 
{f{xQ)-e,f{xo)+e). 

Comparing these two definitions, it is clear that xq G C(/) implies 

lim f{x) = f{xo). 

On the other hand, suppose limj;^^:;, f{x) = f{xo) and £ > 0. Choose 5 
according to the definition of limit. When x G (xq — S,xo + S) Ci D\ {xq}, then 
f{x) G ifixo) — e, /(.Xo) + e). It follows from this that when x = xq, then 
f{x) — /(xo) f{xo) ^ f{xo) — 0 < £■ Therefore, when x G (xo — 5, Xo + (5) fl D, 
then /(x) G (/(xo) — e, /(xo) + e), and xq G C(/), as desired. □ 

Example 6.3.3. If /(x) = c, for some c G M, then Example 6.1.1 and Theorem 
6.3.2 show that / is continuous at every point. 

Example 6.3.4. If f{x) = x, then Example 6.1.2 and Theorem 6.3.2 show that 
/ is continuous at every point. 

Corollary 6.3.3. Let f : D ^ R and xq e D. xq G C(/) iff whenever 
sequence from D with x„ — )• xq, then f{xn) — ^ f{xo)- 

Proof. Combining Theorem 6.3.2 with Theorem 6.1.1 shows this to be true. □ 
Example 6.3.5. Suppose 

fix)- 




For each x G Q, there is a sequence of irrational numbers converging to x, 
and for each y G Q° there is a sequence of rational numbers converging to y. 
Corollary 6.3.3 shows C{f) = 0. 
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Example 6.3.6 (Salt and Pepper Function). Since Q is a countable set, it can 
be written as a sequence, Q = {<;„ : n e N}. Define 



If x e Q, tlien X = Qn, for some n and f{x) = 1/n > 0. There is a sequence 
Xn from Q'^ such that Xn ^ x and f{xn) = 0 -/^ f{x) = 1/n. Therefore 



On the other hand, let a; € and e > 0. Choose N € N large enough so 
that 1/A'' < e and let S = min{|x — g„| : 1 < n < N}. If |a; — j/| < S, there 
are two cases to consider. If y G Q*^, then \f{y) — f{x)\ = |0 — 0| = 0 < e. If 
y e Q, then the choice of 6 guarantees y = Qn for some n > N. In this case, 
1/(2/) - fix)\ = f{y) = fiqn) = 1/n < 1/N < s. Therefore, x e C{f). 

This shows that C{f) = Q". 

It is a consequence of the Baire category theorem that there is no function 
/ such that C(/) = Q. Proving this would take us too far afield. 

The following theorem is an almost immediate consequence of Theorem 6.1.3. 

Theorem 6.3.4. Let f : D ^ R and g : D ^ R. If xq e C{f) n C{G), then 

(a) xo e C{f + g), 

(b) Xo e C{af), \/a e R, 

(c) Xo e C{fg), and 

(d) Xo e C{f/g) when g{xo) ^ 0. 

Corollary 6.3.5. If f is a rational function, then f is continuous at each point 
of its domain. 

Proof. This is a consequence of Examples 6.3.3 and 6.3.4 used with Theorem 
6.3.4. □ 

Theorem 6.3.6. Suppose / : D/ M and g : Dg such that f{Df) C Dg. 
If there is an xo € C(/) such that f{xo) € C{g), then xo € C{g o /). 

Proof. Let e > 0 and choose > 0 such that 



Since this shows Theorem 6.3.1(c) is satisfied at xq with the function 50/, it 




C(/)nQ = 0. 



g{{f{xo) - 61, f{xo) + Si)nDg)c{go f{xo) -e,go f{xo) + s). 
Choose ^2 > 0 such that 

/((a;o - 62, Xo + S2) n Df) c (/(a;o) - Si, f{xo) + di). 



Then 



9 o f{{xo - S2, xo + 52) n Df) c g{{f{xo) - Si, f{xo) + 5i) n Dg) 

c {gof{xo)-52,gof{xo) + S2)nDf). 



follows that xq e C{g o f). 



□ 
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Example 6.3.7. If f{x) = y/x for a: > 0, then according to Problem 6.8, C{f) = 
[0, oo). Theorem 6.3.6 shows / o /(x) = ^ is continuous on [0, oo). 

In similar way, it can be shown by induction that f{x) = x^l"^ is continuous 
on [0, oo) for all m^n&'L. 



6.4 Unilateral Continuity 

Definition 6.4.1. Let / : D ^ M and G D. f is left- continuous at xq if for 
every £ > 0 there is a 5 > 0 such that f{{xo — S, Xo]r\D) C (/(xq) — £, /(a;o) +e). 

Let f : D ^ M. and xq & D. / is right- continuous at xo if for every £ > 0 
there is a 5 > 0 such that /([a;o, xo + 6) (ID) c {f{xo) — £, f{xo) + £)• 

Example 6.4.1. Let the floor function be 

[xj = max{n gZ : n < x} 
and the ceiling function be 

\x] = min{Ti G Z : n > x}. 

The floor function is right-continuous, but not left-continuous at each integer, 
and the ceiling function is left-continuous, but not right-continuous at each 
integer. 

Theorem 6.4.1. Let f : D and xq G D. xq G C{f) iff f is both right and 

left- continuous at Xq. 

Proof. The proof of this theorem is left as an exercise. □ 

According to Theorem 6.2.1, when / is monotone on an interval (a, 6), the 
unilateral limits of / exist at every point. In order for such a function to be 
continuous at xq G (a, 6), it must be the case that 

lim f{x) = f{xo) = lim f{x). 

If either of the two equalities is violated, the function is not continuous at xq. 

In the case, when lim^j-i-a;;, f{x) ^ lima;4.j:(, f{x), it is said that a jump discon- 
tinuity occurs at xq- 

Example 6.4.2. The function 

/(^•) = 

has a jump discontinuity at x = 0. 

In the case when lima;^^,;, f{x) = fi^) f{xo)j it is said that / has a 

removable discontinuity at xq- The discontinuity is called "removable" because 
in this case, the function can be made continuous at Xq by merely redefining its 
value at the single point, xq, to be the value of the two one-sided limits. 
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Figure 6.7: The function from Example 6.4.3. Note that the graph is a line with one 
"hole" in it. Plugging up the hole removes the discontinuity. 

Example 6.4.3. The function f{x) = ^^^2 ^"-"^ continuous at x = 2 because 2 
is not in the domain of /. Since \im.x-^2 fix) = 4, if the domain of / is extended 
to include 2 by setting /(2) = 4, then this extended / is continuous everywhere. 
(See Figure 6.7.) 

Theorem 6.4.2. /// : {a,b) — > M is monotone, then (a, 6) \ C(/) is countable. 

Proof. In light of the discussion above and Theorem 6.2.1, it is apparent that 
the only types of discontinuities / can have are jump discontinuities. 

To be specific, suppose / is increasing and xq, j/o G (a, b)\C{f) with xq < yo. 
In this case, the fact that / is increasing implies 

lim f{x) < lim f{x) < lim f{x) < lim f{x). 

x^xo xlxo xl^yo xlyo 

This implies that for any two xo,yo € (a, &) \ C{f), there are disjoint open in- 
tervals, Ixo = (lim^t^o /(a:),lima;;a;o f{x)) and ly^ = (hm^j^yj, f{x),Hm.xiy„ f{x)). 
For each x € (a, b) \ C{f), choose qx & Ix ri Q. Because of the pairwise disjoint- 
ness of the intervals {I^ : x & (a, 6) \ C(/)}, this defines an bijection between 
(a, b) \ C{f) and a subset of Q. Therefore, (a, b) \ C{f) must be countable. 
A similar argument holds for a decreasing function. □ 

Theorem 6.4.2 implies that a monotone function is continuous at "nearly ev- 
ery" point in its domain. Characterizing the points of discontinuity as countable 
is the best that can be hoped for, as seen in the following example. 

Example 6.4.4. Let D = {dn : n G N} be a countable set and define Jx = {n : 
dn < x}. The function 

/(-) = E ^- (6-1) 

neJx 

is increasing and C{f) = D^. The proof of this statement is left as Exercise 6.9. 
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6.5 Continuous Functions 

Up until now, continuity has been considered as a property of a function at a 
point. There is much that can be said about functions continuous everywhere. 

Definition 6.5.1. Let f : D ^ M. and A C D. We say / is continuous on A if 
A C C(/). If D = C(/), then / is continuous. 

Continuity at a point is, in a sense, a metric property of a function because 
it measures relative distances between points in the domain and image sets. 
Continuity on a set becomes more of a topological property, as shown by the 
next few theorems. 

Theorem 6.5.1. f : D ^ R is continuous iff whenever G is open in R, then 

f~^{G) is relatively open in D. 

Proof. (=>) Assume / is continuous on D and let G be open in M. Let x e f~^{G) 
and choose e > 0 such that {f{x) — e, f{x) + e) c G. Using the continuity of / 
at X, we can find a, 5 > 0 such that f{{x — 6,x + 5) D D) C G. This implies that 
{x — 5,x + S) r\ D c f~^{G). Because x was an arbitrary element of f~^{G), it 
follows that f~^{G) is open. 

{<=) Choose X E D and let e > 0. By assumption, the set f^^({,f{x) — 
e, f{x) + e) is relatively open in D. This implies the existence of a (5 > 0 such 
that {x - 5,x + 5) r\ D c f~^{{f{x) - e, f{x) + e). It follows from this that 
/((a; -5,x + 5)r\D)c {f{x) - s, f{x) + e), and x e C{f). □ 

A function as simple as any constant function demonstrates that /(G) need 
not be open when G is open. Defining / : [0, oo) — >■ K by f{x) = sinxtan"^ x 
shows that the image of a closed set need not be closed because /([O, oo)) = 
(-^2,^/2). 

Theorem 6.5.2. // / is continuous on a compact set K , then f{K) is compact. 

Proof. Let 0 be an open cover of f{K) and 3 = {f~^{G) : G e 0}. By Theorem 
6.5.1, J is a collection of sets which are relatively open in K. Since 3 covers K, 3 
is an open cover of K. Using the fact that K is compact, we can choose a finite 
subcover of K from J, say {Gi, G2, . . . , G„}. There are {Hi,H2, . . . , Hn} C 0 
such that f~^{Hk) = Gfe for 1 < fc < n. Then 

f{K)cfl \J gA= \J Hk. 

\l<fe<n / l<k<n 

Thus, {iJi, H2,..., H3} is a subcover of f{K) from 0. □ 

Several of the standard calculus theorems giving properties of continuous 
functions are consequences of Corollary 6.5.2. In a calculus course, K is usually 
a compact interval, [a, b] . 
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Corollary 6.5.3. If f : K ^ W is continuous and K is compact, then f is 
bounded. 

Proof. By Theorem 6.5.2, f{K) is compact. Now, use the Bolzano- Weierstrass 



Corollary 6.5.4 (Maximum Value Theorem). If f : K ^ M. is continuous and 
K is compact, then there are m,M G K such that f{m) < f{x) < f{M) for all 
xgK. 

Proof. According to Theorem 6.5.2 and the Bolzano- Weierstrass theorem, f{K) 
is closed and bounded. Because of this, g\bf{K) e f{K) and lub/(-ftr) G f{K)- 
It suffices to choose m e /"^(glb f{K)) and M € /-^(lub f{K)). □ 

Corollary 6.5.5. If f : K ^M. is continuous and invertible and K is compact, 
then f~^ : f{K) K is continuous. 

Proof. Let G be open in K. According to Theorem 6.5.1, it suffices to show 

/(G) is open in f{K). 

To do this, note that K \ G is compact, so by Theorem 6.5.2, f{K \ G) is 
compact, and therefore closed. Because / is injective, /(G) = f{K) \ f{K \ G). 
This shows /(G) is open in f{K). □ 

Theorem 6.5.6. // / is continuous on an interval I, then f{I) is an interval. 

Proof. If /(/) is not connected, there must exist two disjoint open sets, U and 
V, such that /(/) C C/ U ^ and /(/) n C/ ^ 0 7^ /(/) n V. In this case. Theorem 
6.5.1 implies f^^iU) and f^^{V) are both open. They are clearly disjoint and 
/-i(J7)n / ^ 0 ^ f-\V)r\I. But, this implies f-^{U) and f-^{V) disconnect 
/, which is a contradiction. Therefore, /(/) is connected. □ 

Corollary 6.5.7 (Intermediate Value Theorem). /// : [a, 6] — )• K is continuous 
and a is between f{a) and f{h), then there is a cG [a, b] such that f{c) = a. 

Proof. This is an easy consequence of Theorem 6.5.6 and Theorem 5.2.1. □ 

Definition 6.5.2. A function f : D ^M. has the Darboux property if whenever 
a,b e D and 7 is between /(a) and /(6), then there is a c between a and b such 
that /(c) = 7. 

Calculus texts usually call the Darboux property the intermediate value prop- 
erty. Corollary 6.5.7 shows that a function continuous on an interval has the 
Darboux property. The next example shows continuity is not necessary for the 
Darboux property to hold. 

Example 6.5.1. The function 



theorem to conclude / is bounded. 



□ 
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is not continuous, but does have the Darboux property. (See Figure 6.4.) It can 
be seen from Example 6.1.6 that 0 ^ C{f). 

To see / has the Darboux property, choose two numbers a <b. 

If a > 0 or 6 < 0, then / is continuous on [a,b] and Corollary 6.5.7 suffices 
to finish the proof. 

On the other hand, if 0 € [a, b] , then there must exist an n e Z such that 
both (44t)^, (4^3)^ G [a,b]. Since fij^;^) = 1, fijj^^) = ~l and / is 
continuous on the interval between them, we see f{[a,b]) = [—1,1], which is the 
entire range of /. The claim now follows. 

6.6 Uniform Continuity 

Most of the ideas contained in this section will not be needed until we begin 
developing the properties of the integral in Chapter 8. 

Definition 6.6.1. A fimction / : D ^ M is uniformly continuous if for all e > 0 
there is a 5 > 0 such that when x,y E D with |a; — y| < 6, then \ f{x) — f(y)\ < e. 

The idea here is that in the ordinary definition of continuity, the 6 in the 
definition depends on both the s and the x at which continuity is being tested. 
With uniform continuity, S only depends on e; i. e., the same S works uniformly 

across the whole domain. 

Theorem 6.6.1. If f : D is uniformly continuous, then it is continuous. 
Proof. This proof is left as Exercise 6.31. □ 

The converse is not true. 

Example 6.6.1. Let f{x) = 1/x on D = (0,1) and e > 0. It's clear that / 
is continuous on D. Let 5 > 0 and choose m,n G N such that m > 1/S and 
n — m > e. If x = 1/m and y = l/n, then 0 < y < x < 6 and f{y) — f{x) = 
n — m > e. Therefore, / is not uniformly continuous. 

Theorem 6.6.2. If f : D ^ M. is continuous and D is compact, then f is 
uniformly continuous. 

Proof. Suppose / is not uniformly continuous. Then there is an e > 0 such that 
for every n G N there are x„. y„ G D with y„| < l/n and \f{xn) — f{yn)\ > £■ 
An application of the Bolzano-Weicrstrass theorem yields a subsequence a;„j. of 
Xn such that Xn^ — >• xo e -D. 

Since / is continuous at xq, there is a (5 > 0 such that whenever x G (xq — 
6,xo + 6)nD, then |/(a;) - f{xo)\ < e/2. Choose Ufe G N such that l/uk < S/2 
and Xnk e {xo - S/2,xo + S/2). Then both Xn^^Vuk € {xo - 8,xo + 5) and 

e < |/(a;„J - /(2/„J| = |/(a;„J - /(xq) + f{xo) - /(2/„J| 

< \f{xn,)- f{x<,)\ + \f{xo)- f{yn,)\<e/2 + s/2 = s, 

which is a contradiction. 

Therefore, / must be uniformly continuous. □ 
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The following corollary is an immediate consequence of Theorem 6.6.2. 

Corollary 6.6.3. If f : [a,b] ^ M. is continuous, then f is uniformly continu- 
ous. 

Theorem 6.6.4. Let D cM. and / : D — >■ K. If f is uniformly continuous and 
Cauchy sequence from D, then f{xn) is a Cauchy sequence.. 

Proof. The proof is left as Exercise 6.37. □ 

Uniform continuity is necessary in Theorem 6.6.4. To see this, let / : (0, 1) 

K be f{x) = 1/x and .t„ = 1/n. Then .t„ is a Cauchy sequence, but f{xn) = n 
is not. This idea is explored in Exercise 6.32. 

It's instructive to think about the converse to Theorem 6.6.4. Let f{x) = x^, 
defined on all of M. Since / is continuous everywhere, Corollary 6.3.3 shows / 
maps Cauchy sequences to Cauchy sequences. On the other hand, in Exercise 
6.36, it is shown that / is not uniformly continuous. Therefore, the converse to 
Theorem 6.6.4 is false. Those functions mapping Cauchy sequences to Cauchy 
sequences are sometimes said to be Cauchy continuous. The converse to Theo- 
rem 6.6.4 can be tweaked to get a true statement. 

Theorem 6.6.5. Let f : D ^M. where D is bounded. If f is Cauchy continuous, 
then f is uniformly continuous. 

Proof Suppose / is not uniformly continuous. Then there is an £ > 0 and 
sequences Xn and y„ from D such that |x„ — y„| < 1/n and \ f(xn) — f{yn)\ > 
Since D is bounded, the sequence a;„ is bounded and the Bolzano- Weierstrass 
theorem gives a Cauchy subsequence, a;„^ . The new sequence 



Zk 



2^"(.+i)/2 k odd 
ynk/2 k even 



is easily shown to be a Cauchy sequence. But, /(z^) is not a Cauchy sequence, 
since \f{zi.) — /(2fc+i)| > £ for all odd k. This contradicts the fact that / 
is Cauchy continuous. We're forced to conclude the assumption that / is not 
uniformly continuous is false. □ 

6.7 Exercises 



6.1. Prove lim (x^ + 3x) = —2. 

X— > — 2 

6.2. Give examples of functions / and g such that neither function has a limit 
at a, but f + g does. Do the same for fg. 



6.3. Let / : £) M and a e D'. 



lim f{x) = L lim/(a;) = lim/(a;) = L 

X— >a xfa xla 
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6.4. Find two functions defined on K such that 



0 = \im {f{x)+g{x)) lim /(a;) + lim o(a;). 



6.5. If lim f{x) = L > 0, then there is a i5 > 0 such that f{x) > 0 when 



0 < \x - a\ < 5. 

6.6. If Q = {g„ : n e N} is an enumeration of the rational numbers and 



then hma;_j.a f{x) = 0, for all a € Q°. 

6.7. Use the definition of continuity to show f{x) = x^ is continuous every- 
where on M. 

6.8. Prove that /(x) = ^/x is continuous on [0,00). 

6.9. If / is defined as in (6.1), then D = C{fY. 

6.10. If / : M — )■ M is monotone, then there is a countable set D such that 
the values of / can be altered on D in such a way that the altered function is 
left-continuous at every point of M. 

6.11. Docs there exist an increasing function / : M — > M such that C(/) — Q? 

6.12. If / : K ^- M and there is an a > 0 such that - f(y)\ < a\x - y\ 
for all x, y G K, then show that / is continuous. 

6.13. Suppose / and g are each defined on an open interval I, a € I and 
a e C(/) n C{g). If /(a) > g{a), then there is an open interval J such that 
f{x) > g{x) for all x G J. 

6.14. li f,g : {a, 6) — )• K are continuous, then G = {x : f{x) < g{x)} is open. 

6.15. If / : K ^ R and a € C{f) with /(a) > 0, then there is a neighborhood 
G of a such that f{G) C (0, 00). 

6.16. Let / and g be two functions which are continuous on a set £> C K. 
Prove or give a counter example: {x & D : f{x) > g{x)} is relatively open in D. 

6.17. If /, g : M — )• M are functions such that f{x) = g{x) for all a; e Q and 



6.18. Let I = [a,b]. If / : / / is continuous, then there is a c e / such that 



a 




C(/) = C{g) = R, then f = g. 



f{c) = c. 
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6.19. Find an example to show the conclusion of Problem 18 fails if 7 = (a, b). 

6.20. If / and g are both continuous on [a,b], then {x : f{x) < g{x)} is 
compact. 

6.21. If / : [a, 6] — >■ ]R is continuous, not constant, 

m = gib {f{x) : a< X <b} and M = lub {f{x) : a <x <b}, 

then /([a, 6]) = [m,M]. 

6.22. Suppose / : M — >■ M is a function such that every interval has points at 
which / is negative and points at which / is positive. Prove that every interval 
has points where / is not continuous. 

6.23. If f : [a,b] ^ M. has a limit at every point, then / is bounded. Is this 
true for / : (a, b) ffi? 

6.24. Give an example of a bounded function / : M K with a limit at no 
point. 

6.25. If / : M — > M is continuous and periodic, then there are Xm, xm € K such 
that f{xm) < f{x) < f{xM) for all a; G M. (A function / is periodic, if there is 
a p e M such that f{x +p) = f{x) for all x e M.) 

6.26. A set 5 C M is disconnected iff there is a continuous / : 5 — >■ M such 
that f{S) = {0, 1}. 

6.27. If / : R ^- R satisfies f{x + y) = f{x) + f{y) for all x and y and 0 € C(/), 
then / is continuous. 

6.28. Assume that / : R R is such that f{x + y) = f{x)f{y) for all a;, y e R. 
If / has a limit at zero, prove that either limj;^o f{x) = 1 or f{x) = 0 for all 
a;eM\{0}. 

6.29. If .9 : M ^ M satisfies g{x + y)= g{x)g{y) for all x, y e R and 0 e C{g), 
then g is continuous. 

6.30. If F c R is closed, then there is an / : R -i> R such that F = C{fy. 

6.31. If / : [a, 6] ^ R is uniformly continuous, then / is continuous. 

6.32. Prove that an unbounded function on a bounded open interval cannot 
be uniformly continuous. 

6.33. If / : Z) ^ R is uniformly continuous on a bounded set D, then / is 
bounded. 

6.34. Prove Theorem 6.6.1. 
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6.35. Every polynomial of odd degree has a root. 

6.36. Show f{x) = x^, with domain R, is not uniformly continuous. 

6.37. Prove Theorem 6.6.4. 
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Chapter 7 

Differentiation 



7.1 The Derivative at a Point 

Definition 7.1.1. Let / be a function defined on a neighborhood of xq. / is 
differentiable at xq, if the following limit exists: 

fixo) = lim /(■^o + /.)-/(a.o) _ 

Define D(/) = {a; : /'(x) exists}. 

The standard notations for the derivative will be used; e. g., f'{x), 
Df{x), etc. 

An equivalent way of stating this definition is to note that if xq G D{f), then 
/ {xq) = hm . 

x^xo X — Xq 

(See Figure 7.1.) 

This can be interpreted in the standard way as the limiting slope of the 
secant line as the points of intersection approach each other. 

Example 7.1.1. If f{x) = c for all x and some c e K, then 
lim fei^^^ = lim ^ = 0. 

h->-0 h h->-0 h 

So, f'{x) = 0 everywhere. 
Example 7.1.2. If f{x) = x, then 

lim /(^o + ^)-/(^o) = XQ + h-XQ ^^^h^^ 

h^O h h->-0 h h->-0 h 

So, f'{x) = 1 everywhere. 

"^June 2, 2014 ©Lee Larson (LLarson@Louisville.edu) 
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slope = f('^-o+'^l-f'-'^-'o) 



o,/(«o)) 




Xq + h 



Xo 



slope 



Xl 



0. fi^'o)) 




Xo 



Figure 7.1: These graphs illustrate that the two standard ways of writing the difTer- 
ence quotient are equivalent. 



Theorem 7.1.1. For any function f, D(f) C C(/). 
Proof. Suppose xq G D{f). Then 

lim \f(x)-fixo)\= lim 

X-^X„ .T->.To X — Xo 

= f'ixo) 0 = 0. 

This shows lim^^^g f{x) = f{xo), and xq e C(f). 

Of course, the converse of Theorem 7.1.1 is not true. 
Example 7.1.3. The function f{x) = \x\ is continuous on M, but 

^.^/(0 + /.)-/(0)^^_^.^/(0 + /.)-/(0) 



□ 



so /'(O) fails to exist. 

Theorem 7.1.1 and Example 7.1.3 show that differentiability is a strictly 
stronger condition than continuity. For a long time most mathematicians be- 
lieved that every continuous function must certainly be differentiable at some 
point. In the nineteenth century, several researchers, most notably Bolzano 
and Weierstrass, presented examples of functions continuous everywhere and 
differentiable nowhere.^ It has since been proved that, in a technical sense, 
the "typical" continuous function is nowhere differentiable [4]. So, contrary to 
the impression left by many beginning calculus classes, differentiability is the 
exception rather than the rule, even for continuous functions.. 



^Bolzano presented his example in 1834, but it was little noticed. The 1872 example 
of Weierstrass is more well-known [2]. A translation of Weierstrass' original paper [16] is 
presented by Edgar [8]. Weierstrass' example is not very transparent because it depends on 
trigonometric series. Many more elementary constructions have since been made. One such 
will be presented in Example 9.5.1. 
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7.2 Differentiation Rules 

Following are the standard rules for differentiation learned in every beginning 
calculus course. 

Theorem 7.2.1. Suppose f and g are functions such that xq e D{f) n D{g). 

(a) xo&D{f + g) and {f + g)' {xq) = f {xo) + g' (xq) . 

(b) If a eR, then xq G D{af) and {af)'{xo) = af'{xo). 

(c) Xq € D(fg) and (fgy{xo) = f'{xo)g{xo) + f{xo)g'{xQ). 

(d) If g{xo) ^ 0, then xq e D{f/g) and 



/V/ ^ _ f'{xo)g{xo) - f{xo)g'{xo) 
g)^''>- {g{xo)y 



Proof, (a) 

lim + + + 

^ /(a;o + /i) + ff(a;o + /;,)- /(a:o) - g{xo) 
h^a h 
^ li^ ( /(xo + fe) - f{xo) ^ g{xo + h) -g{xo) \ ^ ^, ^ ^, 
h->-o \ n hi 



(b) 



(c) 



{af){x, + h) -{af){x,) ^ ^ /(XQ + fo) - /(xq) ^ 



^.^^ (/fl)(a:o + /t) - {fg){xo) ^ /(a;o + /i.)ff(a;o + /t) - f{xo)g{xo) 

Now, "slip a 0" into the numerator and factor the fraction. 

^ ^.^ fjxo + h)g{xo + h)- f{xo)g{xo + h) + f{xo)g{xo + h) - f{xo)g{xo) 
h^o h 

/i->o \ n h 

Finally, use the definition of the derivative and the continuity of / and g aX xq. 

= f'{xo)g{xo) + f{xo)g'{xQ) 

June 2, 2014 http://math.louisville.edu/'-^lee/ira 



7-4 



CHAPTER 7. DIFFERENTIATION 



(d) It will be proved that if .g(a;o) 7^ 0, then {l/g)'{xo) = -g'{xo)/{g{xo)f. This 
statement, combined with (c), yields (d). 

1 1 



j-jj^ (l/ff)(a;o + h)- (l/ff)(a;o) ^ ^.^ g{xo + /t) g{xo) 

h^O h h^O h 

^ Ijjjj 9{xo) - gjxQ + h) 1 

h^o h g{xo + h)g{xo) 

{g{xor 



Plug this into (c) to see 

7\' 



{xo) ={f-] (xo) 



nxo)^,+fixo)j^^ 

g[xo) {g{xo)r 

f'{xo)g{xo) - f{xo)g'{xo) 

{gixoW 



□ 



Combining Examples 7.1.1 and 7.1.2 with Theorem 7.2.1, the following the- 
orem is easy to prove. 

Corollary 7.2.2. A rational function is differentiable at every point of its do- 
main. 

Theorem 7.2.3 (Chain Rule). // / and g are functions such that xo G D{f) 
and f{xo) e D{g), then xq e D{g o /) and {g o /)'(a;o) = g' o f{xo)f'{xo). 

Proof. Let t/o — f{xo). By assumption, there is an open interval J containing 
f{xo) such that g is defined on J. Since J is open and xq G C{f), there is an 
open interval I containing xq such that /(/) C J. 
Define h: J ^Rhy 

{5(2/) -5(2/0) X , 
g{yo), vi^yo 
y-yo 
0, y = yo 

Since yo G D{g), we see 

hm h{y) = lim ^iy) ' M _ ^^^^ ^ _ ^ ^ ^ ^^^^^^ 

y^yo y->-yo y — yo 

so yo G C{h). Now, xq € C(/) and /(xq) =yo & C{h), so Theorem 6.3.6 implies 
Xo e C(/i o /). In particular 

lim hof{x) =0. (7.1) 
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Prom the definition oi ho f fov x G I with f{x) ^ f{xo), we can solve for 

9 o fix) - g o f{xo) = {h o fix) + g' o fixo)){f{x) - f{xo)). (7.2) 

Notice that (7.2) is also true when f{x) = /(xq). Divide both sides of (7.2) by 
X — xq, and use (7.1) to obtain 

lim 9of{x)-gofixo) ^ Hm (/^ o /(x) + ff- o /(xo)) ^^"^ " 

x^xo X — Xo x^xo X — Xq 

= (0 + g'o/(xo))/'(aro) 
= 9'of{xo)f'{xo). 

□ 

Theorem 7.2.4. Suppose f : [a, b] — >■ [c, d] is continuous and invertible. If 
Xo e D{f) and f'{xo) ^ 0 for some xq G (a, 6), then f{xo) € D{f~^) and 
{f-')'{f{xo)) = l/f'ixo). 

Proof. Let yo — f{xo) and suppose y„ is any sequence in /([a, 6]) \ {yo} con- 
verging to yo and a;„ = f~^{yn)- By Theorem 6.5.5, f~^ is continuous, so 

Xo = f~^{yo) = lim f~^{yn) = lim 

Therefore, 

lim r\yn) - f-Hvo) ^ x^-xo _ 1 



Vn - yo /(a;„) - f{xo) f'{xo) ' 



□ 



Example 7.2.1. It follows easily from Theorem 7.2.1 that f{x) = x^ is differen- 
tiable everywhere with f'{x) = 3a;^. Define g{x) = \fx. Then g{x) = f^^{x). 
Suppose g{yo) = xo for some yo € M. According to Theorem 7.2.4, 

'( \= = ^ = ^ = ^ = ^ 

f'ixo) 3x1 myo))' 3(^)2 3y2/3- 

In the same manner as Example 7.2.1, the following corollary can be proved. 

Corollary 7.2.5. Suppose q £ Q, fix) = x'^ and D is the domain of f. Then 
f'{x) = qx'^~^ on the set 

J D, when q>l 

\d\ {0}, when q<l' 
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7.3 Derivatives and Extreme Points 

As is learned in calculus, the derivative is a powerful tool for determining the 

behavior of functions. The following theorems form the basis for much of differ- 
ential calculus. First, we state a few familiar definitions. 

Definition 7.3.1. Suppose / : D — )• M and xq E D. f is said to have a 
relative maximum at xq if there is a 5 > 0 such that f{x) < f{xo) for all 
X S {xo — d,Xo + S) f] D. f has a relative minimum at xq if — / has a relative 
maximum at xq. If / has either a relative maximum or a relative minimum at 
Xo, then it is said that / has a relative extreme value at xq. 

The absolute maximum of / occurs at xo if f{xo) > f{x) for all x G D. The 
definitions of absolute minimum and absolute extreme are analogous. 

Examples like f{x) = x on (0, 1) show that even the nicest functions need 
not have relative extrema. Corollary 6.5.4 shows that if D is compact, then any 
continuous function defined on D assumes both an absolute maximum and an 
absolute minimum on D. 

Theorem 7.3.1. Suppose f : (a, b) — M. // / has a relative extreme value at 
Xo and xo G D{f), then f'{xo) = 0. 

Proof. Suppose f{xo) is a relative maximum value of /. Then there must be a 
S > 0 such that f{x) < f{xo) whenever x G {xo — S,xo + 5). Since f'{xo) exists, 

xG{xo-5,xo) =^ fM^fM>o /'(xo) = hm M^/^ > 0 

X — Xo xtxo X — Xo 

(7.3) 

and 

xG{xo,xo + 5) =^ M^IM<o ^ />o)=lim - /("°) < q. 

X — Xo xlxo X — Xo 

(7.4) 

Combining (7.3) and (7.4) shows f'{xo) = 0. 

If f{xo) is a relative minimum value of /, apply the previous argument to 
-/• □ 



Theorem 7.3.1 is, of course, the basis for much of a beginning calculus course. 
If / : [a, b] M, then the extreme values of / occur at points of the set 

C = {x G (a, b) : f'{x) = 0}U{x G [a, b] : f'{x) does not exist}. 

The elements of C are often called the critical points or critical numbers of / on 
[a, b] . To find the maximum and minimum values of / on [a, b] , it suffices to find 
its maximum and minimum on the smaller set C, which is finite in elementary 
calculus courses. 
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7.4 Differentiable Functions 

Differentiation becomes most useful when a function has a derivative at each 
point of an interval. 

Definition 7.4.1. The function / is differentiable on an open interval I if 
/ C D{f ). If / is differentiable on its domain, then it is said to be differentiable. 
In this case, the function /' is called the derivative of /. 

The fundamental theorem about differentiable functions is the Mean Value 
Theorem. Following is its simplest form. 

Lemma 7.4.1 (Rollc's Theorem). // / : [a, 6] — )■ M is continuous on [a, 6], 
differentiable on (a, fe) and f{a) = 0 = fib), then there is a c & {a,b) such that 
/'(c) = 0. 

Proof. Since [a, b] is compact, Corollary 6.5.4 implies the existence of Xm, xm G 
[a,h] such that /(a;„) < f{x) < f{xM) for all x G [a,b]. If f{xm) = fixm), 
then / is constant on [a, b] and any c G (a, b) satisfies the lemma. Otherwise, 
either f{xm) < 0 or f{xM) > 0. If f{xm) < 0, then Xm S (a, 6) and Theorem 
7.3.1 implies f'{xm) = 0. If f{xM) > 0, then xm € (a, 6) and Theorem 7.3.1 
implies f'{xM) = 0. □ 

Rollc's Theorem is just a stepping-stone on the path to the Mean Value 
Theorem. Two versions of the Mean Value Theorem follow. The first is a 
version more general than the one given in most calculus courses. The second 
is the usual version.^ 

Theorem 7.4.2 (Cauchy Mean Value Theorem). // / : [a,b] — > M and g : 
[a,b] — >■ M are both continuous on [a,b] and differentiable on (a, 6), then there is 
a c£ (o, b) such that 

9'{c){f{b)-f{a)) = f\c){9{h)-g{a)). 

Proof. Let 

h{x) = {g{b) - g{a)){f{a) - f{x)) + {g{x) - g{a)){f{b) - f{a)). 

Because of the assumptions on / and g,his continuous on [a, b] and differentiable 
on (a, 5) with h{a) = h{b) = 0. Theorem 7.4.1 yields ace (a, 6) such that 
h'{c) = 0. Then 

0 = h'ic) = -{g{b) - g{a))f'{c) + g'{c){f{b) f{a)) 

=^ 9'{c){f{b) - /(a)) = nc){gib) - gia)). 

□ 
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b — a 



Figure 7.2; This is a "picture proof" of Corollary 7.4.3. 



Corollary 7.4.3 (Mean Value Theorem). If f : [a, 6] — K is continuous on [a, b] 
and differentiable on (a, 6), then there is a c G {a,b) such that f{b) — f{a) = 



Many of the standard theorems of beginning calculus are easy consequences 
of the Mean Value Theorem. For example, following are the usual theorems 
about monotonicity. 

Theorem 7.4.4. Suppose f : (a,b) ^ R is a differentiable function, f is 
increasing on (a, 6) iff f (x) > 0 for all x G {a,h). f is decreasing on {a,b) iff 
fix) < 0 for all X G (a, 5). 

Proof. Only the first assertion is proved because the proof of the second is pretty 
much the same with all the inequalities reversed. 

(=>) li x,y € {a,b) with x ^ y, then the assumption that / is increasing 
gives 



Let x,y E {a,b) with x < y. According to Theorem 7.4.3, there is a 
c £ {x, y) such that f{y) - f{x) = f'(c){y - x) > 0. This shows f{x) < f{y), so 



Corollary 7.4.5. Let / : (a, 6) ^ R be a differentiable function, f is constant 



It follows from Theorem 7.1.1 that every differentiable function is continuous. 
But, it's not true that a derivative need be continuous. 

Example 7.4.1. Let 




□ 




/ is increasing on (a, b). 



□ 



iff f'{x) = 0 for all x e (a, 6). 




We claim / is differentiable everywhere, but /' is not continuous. 
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To see this, first note that when a; ^ 0, the standard differentiation formulas 
give that f'{x) = 2a;sin(l/a;) — cos(l/a;). To calculate /'(O), choose any h ^ 0. 
Then 



fih) 




sin(l//i) 


< 




h 




h 







^\h\ 



and it easily follows from the definition of the derivative and the Squeeze The- 
orem (Theorem 6.1.2) that /'(O) = 0. 

Let Xn = l/27rn for n e N. Then x„ — > 0 and 

/'(x„) = 2xn sin(l/a::„) - cos(l/a;„) = -1 

for all n. Therefore, f'{xn) — > — 1 7^ 0 = /'(O), and /' is not continuous at 0. 

But, derivatives do share one useful property with continuous functions; they 
satisfy an intermediate value property. Compare the following theorem with 
Corollary 6.5.7. 

Theorem 7.4.6 (Darboux's Theorem). // / is differentiable on an open set 
containing [a,b] and 7 is between f'(a) and f'(b), then there is a c £ [a, 5] such 
that f'{c) = 7. 

Proof. If /'(a) = f'{b), then c = a satisfies the theorem. So, we may as well 
assume /'(a) ^ f'(b). There is no generality lost in assuming /'(a) < f'{b), for, 
otherwise, we just replace / with g = —f. 




Figure 7.3: This could be the function h of Theorem 7.4.6. 

Let h{x) = f{x) - jx so that D{f) = D{h) and h'{x) = f'{x) - 7. In 
particular, this implies h'{a) < 0 < h'{b). Because of this, there must be an 
e > 0 small enough so that 

h(a + e) — h(a) ^ , , , , , , 

^ ^ — < 0 hia + e) < h{a) 

and 

h{b)~h[b~e) ^ , x , 
^ > 0 =^ h{b - e) < h{b). 

e 

(See Figure 7.3.) In light of these two inequalities and Theorem 6.5.4, there 
must be a c e (a, b) such that h{c) — gib {h{x) : x £ [a, b]}. Now Theorem 7.3.1 
gives 0 = h'{c) = /'(c) — 7, and the theorem follows. □ 

Here's an example showing a possible use of Theorem 7.4.6. 
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Example 7.4.2. Let 

1, x = 0' 

Theorem 7.4.6 implies / is not a derivative. 

A more striking example is the following 
Example 7.4.3. Define 

^ ' \l, x = 0 ^ ' [-1, x = 0 

Since 

'O, Xt^O 



f{x) - g{x) 



2, a; = 0 



does not have the intermediate value property, at least one of / or g is not a 
derivative. (Actually, neither is a derivative because f{x) = —g{—x).) 



7.5 Applications of the Mean Value Theorem 

In the following sections, the standard notion of higher order derivatives is used. 
To make this precise, suppose / is defined on an interval I. The function / itself 
can be written /'■'^-'. If / is differentiable, then /' is written f''^\ Continuing 
inductively, if n e w, /("^ exists on / and xq e £)(/(")), then /^""'"^^(a;o) = 
d/(")(a;o)/dx. 



7.5.1 Taylor's Theorem 

The motivation behind Taylor's theorem is the attempt to approximate a func- 
tion / near a number a by a polynomial. The polynomial of degree 0 which 
does the best job is clearly po{x) = /(a). The best polynomial of degree 1 is the 
tangent lino to the graph of the function pi{x) — f{a) + f'(a){x — a). Continuing 
in this way, we approximate / near a by the polynomial Pn of degree n such 
that /('=)(a) = pit'\a) for k — 0,1, ... ,n. A simple induction argument shows 
that 

Pn{x) = J2^-^(^ (7-5) 

k=0 

This is the well-known Taylor polynomial of / at o. 

Many students leave calculus with the mistaken impression that (7.5) is the 
important part of Taylor's theorem. But, the important part of Taylor's theorem 
is the fact that in many cases it is possible to determine how large n must be 
to achieve a desired accuracy in the approximation of /; i.e., the error term is 
the important part. 

5 
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Theorem 7.5.1 (Taylor's Theorem). If f is a function such that /, /', . . . , /'"^ 
are continuous on [a, b] and exists on (a, b), then there is a c G (a, b) such 

that 

k=0 ^ ' 

Proof. Let the constant a be defined by 

kr^^-""^ ' (n+l)! 



m = f: ^ ib-ar + (6 - a)"+^ (7.6) 



fe=0 

and define 



Fix) = m - (t - + (b ^ .) 



n+l 



From (7.6) we see that F{a) = 0. Direct substitution in the definition of F 
shows that F{b) = 0. From the assumptions in the statement of the theorem, 
it is easy to see that F is continuous on [a,b] and diffcrcntiable on (a, 6). An 
apphcation of Rolle's Theorem yields a c e (a, 6) such that 

0 = F'ic) = -(^^^^^^{b-cr--(b-cr) =^ a = f^^+'\c), 

as desired. □ 

Now, suppose / is defined on an open interval / with a,x £ I. If / is n + 1 
times differentiable on I, then Theorem 7.5.1 implies there is a c between a and 
X such that 

fix) =Pn{x) + Rf{n,x,a), 

where Rf{n, x, a) = ^\n+i)\' ^ a)"''^^ is the error in the approximation.^ 

Example 7.5.1. Let f{x) = cosx. Suppose we want to approximate /(2) to 5 
decimal places of accuracy. Since it's an easy point to work with, we'll choose 
a = 0. Then, for some c e (0, 2), 

\f(n+^)(r\\ 2"+i 
l%(n>2,0)| = L_(^2-<_. (7.7) 

A bit of experimentation with a calculator shows that n = 12 is the smallest n 
such that the right-hand side of (7.7) is less than 5 x 10^*^. After doing some 
arithmetic, it follows that 

22 2^ 2<5 2^ 2io 2^2 27809 ..r. . 

is a 5 decimal place approximation to cos(2). 



^There are several diflferent formulas for the error. The one given here is sometimes called 
the Lagrange form of the remainder. In Example 8.7.1 a form of the remainder using integra- 
tion instead of differentiation is derived. 
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4 - 



2 - 



2 - 



4 - 




Figure 7.4: Here are several of the Taylor polynomials for the function cos(a;) graphed 
along with cos(x). 

But, things don't always work out the way we might like. Consider the 
following example. 

Example 7.5.2. Suppose 



In Example 7.5.4 below it is shown that / is differentiable to all orders every- 
where and /^"•'(O) — 0 for all n > 0. With this function the Taylor polynomial 
centered at 0 gives a useless approximation. 

7.5.2 L'Hospital's Rules and Indeterminate Forms 

According to Theorem 6.1.3, 

1- f{x) _ limx^a fix) 



whenever \im,j:^a f{x) and linix^a g{x) both exist and Imix^a g{x) =/= 0. But, it 
is easy to find examples where both lima;_j.£i f{x) — 0 and liuix^a g{x) ~ 0 and 
lirRx^a f{x)/ g{x) exists, as well as similar examples where Imix^a fix)/g{x) 
fails to exist. Because of this, such a limit problem is said to be in the inde- 
terminate form 0/0. The following theorem allows us to determine many such 
limits. 

Theorem 7.5.2 (Easy L'Hospital's Rule). Suppose f and g are each continuous 
on [a,b], differentiable on (a, 5) and f[h) = g{b) = 0. If g'{x) ^ Q on {a,b) and 
lim^;^;, f'{x)/g'{x) — L, wtiere L could be infinite, then lim^,^;, f{x)/g{x) = L. 
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Proof. Let x € [a,b), so / and g arc continuous on [x,b] and difFercntiablc 
on {x,b). Cauchy's Mean Value Theorem, Theorem 7.4.2, impUes there is a 
c{x) e {x, b) such that 

/'(c(xM.).»'(c(x))/M^M = ^||a. 

Since x < c{x) < b, it follows that limj,-!-;, c{x) = b. This shows that 
L = limm=lim/:mi=lim.^^^) 



rt6 g'{x) xtb g'{c{x)) xtb g{x) ' 



□ 



Several things should be noted about this proof. First, there is nothing spe- 
cial about the left-hand limit used in the statement of the theorem. It could just 
as easily be written in terms of the right-hand limit. Second, if limj;^^ fi^) / g{x) 
is not of the indeterminate form 0/0, then applying L'Hospital's rule will usually 
give a wrong answer. To see this, consider 

X 1 

lim =0^1= lim -. 

x^o x + 1 x^o 1 

Another case where the indeterminate form 0/0 occurs is in the limit at 
infinity. That L'Hopital's rule works in this case can easily be deduced from 
Theorem 7.5.2. 

Corollary 7.5.3. Suppose f and g are differentiable on {a, oo) and 

lim f{x) = lim g{x) = 0. 

V g'i^) 0 on {a,oo) and linix^oo f (x) / g' {x) = L, where L could be infinite, 
then lima,^.oo f{x)/g{x) = L. 

Proof. There is no generality lost by assuming a > 0. Let 

j/d/x), ..(0,1/a] Gix)=h'/'^' x.(0,l/a]_ 

^ [0, x = 0 \0, x = 0 



Then 



lim_F(a;) — lim f{x) = 0 = lim g(x) = limG(.T), 

xlO a;— >oo rr— >-oo ' x-lQ 



so both F and G are continuous at 0. It follows that both F and G are con- 
tinuous on [0, 1/a] and differentiable on (0, 1/a) with G'{x) = —g'{x)/x^ ^ 0 
on (0, 1/a) and Hma;4,o F'{x)/G'{x) = limj^^oo f'{x)/g'{x) = L. The rest follows 
from Theorem 7.5.2. □ 

The other standard indeterminate form arises when 

lim f{x) = oo = lim g{x). 

x—>oo X— >oo 

This is called an oo/oo indeterminate form. It is often handled by the following 
theorem. 
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Theorem 7.5.4 (Hard L'Hospital's Rule). Suppose that f and g are differen- 
tiable on (a, oo) and g'{x) ^ 0 on (a, oo). // 



lim f{x) = lim g{x) = oo and 



x-s-cx) g'[x) 



. U {— oo, oo}, 



then 



fix) 



lim 



L. 



Proof. First, suppose L gM. and let e > 0. Choose ai > a large enough so that 



9'{x) 



- L 



< £, Va; > ai. 



Since limj;_>.oo /(a;) = oo = Imix^cc g{x), we can assume there is an 02 > ai 
such that both f{x) > 0 and g{x) > 0 when a; > 02- Finally, choose 03 > 02 
such that whenever x > 03, then f{x) > f{a2) and g{x) > 3(02)- 

Let X > 03 and apply Cauchy's Mean Value Theorem, Theorem 7.4.2, to / 
and g on [02, x] to find a c(x) € (02, x) such that 



/'(c(x)) _ /(x)-/(«2) 

5'(c(x)) 



1 



/fed 



(7.6 



If 



/i(x) = 



1 - 



g(a2) 



1 _ /(£2) ' 



then (7.8) implies 



m /'(c(x)) 



ft(x). 



g{x) g'{c{x))' 

Since lima;_).oo ^(a;) = 1, there is an ai > as such that whenever x > 04, then 
\h{x) — 1| < e. If X > a4, then 



fix) 



9ix) 



-L 



f'icix) 



g'ic{x) 

f'icix) 



< 



5'(c(x) 
ficix) 



h{x) - L 

h{x) - Lh{x) + Lh{x) 



L 



L 



|/i(x)| + |L||/i(x)-l| 



g'{c{x)) 

< e{l + e) + \L\e = (1 + \L\ + s)e 
can be made arbitrarily small through a proper choice of e. Therefore 

lim f{x)/g{x) = L. 

x—^oo 

The case when L = oo is done similarly by first choosing a, B > 0 and 
adjusting (7.8) so that f'{x)/g'{x) > B when x > a\, A similar adjustment is 
necessary when L = — oo. □ 
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There is a companion corollary to Theorem 7.5.4 which is proved in the same 
way as Corollary 7.5.3. 

Corollary 7.5.5. Suppose that f and g are continuous on [a, b] and differen- 
tiable on (a, b) with g'{x) on (a, 6). // 

fix) 

lim/(a;) = lim5(a;) = 00 and lim = £ g IR U {— oo, oo}, 

x\.a x\.a x^a g i^X) 



then 

ria g{x) 



Exam,ple 7.5.3. If a > 0, then \Ymx^x h^x/x°' is of the indeterminate form 
oo/oo. Taking derivatives of the numerator and denominator yields 

1/^ 1 ^ 
lim — ^— — j- = lim = 0. 

>oo ax" rr— >-oo ax" 

Theorem 7.5.4 now implies W.m.x^co^'n.x/x" = 0, and therefore In a; increases 
more slowly than any positive power of x. 

Example 7.5.4. Let / be as in Example 7.5.2. It is clear f^^\x) exists whenever 
n G u) and x ^ 0. We claim /^"^(O) = 0. To see this, we first prove that 

-l/x^ 

lim =0, Vn e Z. (7.9) 

x^-o a;" 

When n < 0, (7.9) is obvious. So, suppose (7.9) is true whenever m < n for 
some new. Making the substitution u = 1/x, we see 

lim = lim — (7.10) 

xio «->oo e" 

Since 

(n + l)u" (n+l)u"-i n+l. e"!/^' „ 

lim 5 — = lim 5 = lim t- = 0 

u^oo 2ue« u->-oo 26" 2 x4.o 

by the inductive hypothesis. Theorem 7.5.4 gives (7.10) in the case of the right- 
hand limit. The left-hand limit is handled similarly. Finally, (7.9) follows by 
induction. 

When a; 7^ 0, a bit of experimentation can convince the reader that /^"■'(x) 
is of the form pn{l/x)e~^^^ , where p„ is a polynomial. Induction and repeated 
applications of (7.9) establish that /^"^(O) = 0 for n e w. 



7.6 Exercises 
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7.1. If 

m = 



X^, X G 



0, otherwise 

then show D{f) = {0} and find /'(O). 

7.2. Let / be a function defined on some neighborhood of a; = a with /(a) = 0. 
Prove /'(a) = 0 if and only if o e D{\f\). 

7.3. If / is defined on an open set containing xq, the symmetric derivative of 
/ at xq is defined as 

r(xo) = hm /(^o + fe)- /(^o-^) . 
h^o 2h 

Prove that if f'{x) exists, then so does f^{x). Is the converse true? 

7.4. Let G be an open set and / € D{G). If there is an a e G such that 
lini^^.„ f{x) exists, then lim^^a f'{x) = f'{a). 

7.5. Prove or give a counter example: If / € D{{a, b)) such that /' is bounded, 
then there is an F e C{[a, b]) such that / = F on (a, b). 

7.6. Suppose / is continuous on [a,b] and /" exists on {a,b). If there is an 

xo € (a, 6) such that the line segment between (a, /(a)) and {b, f{b)) contains 
the point (xo, /(xq)), then there is a c S (a, b) such that /"(c) = 0. 

7.7. 

Problem 1. If A = {/:/ = F' for some F : R — > K}, then A is closed under 
addition and scalar multiplication. (This shows the differentiable functions form 
a vector space.) 



7.8. If 



and 



/i(^) 



f2{x) 



jl/2, x = 0 

lsin(l/x), x^O 

'l/2, x = 0 

sin(— 1/x), X 7^ 0 ' 



then at least one of /i and /2 is not in A. 

7.9. Prove or give a counter example: If / is continuous on M and differentiable 
on M \ {0} with lima;^o f'{x) = L, then / is differentiable on R. 

7.10. Suppose / is differentiable everywhere and f(x + y) = f{x)f{y) for all 
x,y gM.. Show that /'(x) = /'(0)/(x) and determine the value of /'(O). 
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7.11. If J is an open interval, / is difFerentiable on / and a G I, then there is 
a sequence a„ S / \ {a} such that an ^ a and f'{an) — >■ f'{cL)- 

7.12. Use the definition of the derivative to find -^\/x. 

ax 

7.13. Let / be continuous on [0,oo) and differentiable on (0, oo). If /(O) = 0 
and < |/(x)| for all a; > 0, then f{x) = 0 for all a; > 0. 

7.14. Suppose / : M — >• K is such that /' is continuous on [a,b]. If there is a 
c e (a, 6) such that /'(c) = 0 and /"(c) > 0, then / has a local minimum at c. 

7.15. Prove or give a counter example: If / is continuous on R and differen- 
tiable on K \ {0} with limj:_).o f'{x) = L, then / is differentiable on R. 

7.16. Let / be continuous on [a,b] and differentiable on {a,b). If /(a) = a 
and I/' (a;) I < /3 for all x G {a, b), then calculate a bound for f{b). 

7.17. Suppose that / : (a, 6) ^ M is differentiable and /' is bounded. If a;„ is 
a sequence from (a, b) such that Xn a, then /(a;„) converges. 

7.18. Let G be an open set and / e D{G). If there is an a € G such that 
lim^^.a/'(x) exists, then lim^^af'ix) = f'{a). 

7.19. Prove or give a counter example: If / G D{{a, h)) such that /' is bounded, 
then there is an € C{[a, b]) such that f = F on (a, b). 

7.20. Show that f{x) = a;^ -|- 2a; -|- 2 is invertible on K and, ii g = f~^, then 
find g'{l). 

7.21. Suppose that / is an open interval and that f"{x) > 0 for all x G /. If 
a G I, then show that the part of the graph of / on / is never below the tangent 
line to the graph at (o, f{a)). 

7.22. Suppose / is continuous on [a,b] and /" exists on (o, 6). If there is an 

Xo € {a,b) such that the line segment between (a, /(a)) and {b,f{b)) contains 
the point {xq, f{xo)), then there is a c e (a, b) such that /"(c) = 0. 

7.23. Let / be defined on a neighborhood of x. 
(a) If f"{x) exists, then 



(b) Find a function / where this limit exists, but f"{x) does not exist. 



lim 



fix -h)- 2f{x) + fix + h) 



= /"(^)- 
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7.24. If / : M M is diffcrcntiablc everywhere and is even, then /' is odd. If 
/ : M — )■ M is differcntiable everywhere and is odd, then /' is even/ 

7.25. Prove that 



sma; 



6 120 



< 



5040 



when \x\ < 1. 



'^A function g is even if g{—x) = g{x} for every x and it is odd if g(—x) = —g{x) for every 
X. The terms are even and odd because this is how g{x) = x" behaves when n is an even or 
odd integer, respectively. 
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Chapter 8 

Integration 



Contrary to the impression given by most calculus courses, there are many ways 
to define integration. The one given here is called the Riemann integral or the 
Ricrnami-Darboux integral, and it the one most commonly presented to calculus 
students. 

8.1 Partitions 

A partition of the interval [a, b] is a finite set P C [a, b] such that {a, 6} C P. The 
set of all partitions of [a, b] is denoted part ([a, b]). Basically, a partition should 
be thought of as a way to divide an interval into a finite number of subintervals 
by choosing some points where it is divided. 

If P e part ([a, 6]), then the elements of P can be ordered in a list as a = 
xo < x\ < ■ ■ ■ < Xn = b. The adjacent points of this partition determine n 
compact intervals of the form ij^ = [xk_i,Xk], 1 < k < n. If the partition is 
clear from the context, we write 1^ instead of . It's clear that these intervals 
only intersect at their common endpoints and there is no requirement they have 
the same length. 

Since it's inconvenient to always list each part of a partition, we'll use the 
partition of the previous paragraph as the generic partition. Unless it's neces- 
sary within the context to specify some other form for a partition, assume any 
partition is the generic partition. 

If / is any interval, its length is written \I\ . Using the notation of the previous 
paragraph, it follows that 

n n 

X] 1-^*^1 ^ ~ ^'^-i^ = Xn-xo = b-a. 

fe=i fe=i 

The norm of a partition P is 

||P|| =max{|jf I : 1 < fc < n}. 
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In other words, the norm of P is just the length of the longest subinterval 
determined by P. If \Ik\ = \\P\\ for every 7^, then P is called a regtt/ar partition. 

Suppose P,Q £ part([a, &]). If P C Q, then Q is called a refinement of 
P. When this happens, we write P <^ Q. In this case, it's easy to see that 
P <Si Q implies ||P|| > \\Q\\. It also follows at once from the definitions that 
P U Q G part([a, b]) with P < P U Q and Q < P U Q. The partition P U Q is 
called the common refinement of P and Q. 



8.2 Riemann Sums 

Let / : [a, 6] — >■ R and P e part([a, 6]). Choose x*f. G Ik for each k. The set 
{x\ : 1 < fc < n} is called a selection from P. The expression 



3^(/,P,4) = ^/(4)|4| 
fe=l 

is the Riemann sum for / with respect to the partition P and selection x*).. No- 
tice that given a particular function / and partition P, there are an uncountably 
infinite number of different possible Riemann sums, depending on the selection 
xjj. This sometimes makes working with Riemann sums quite complicated. 

Example 8.2.1. Suppose / : \a,b] R is the constant function f(x) = c. If 
P G part ([a, b]) and {x^ : 1 < A; < n} is any selection from P, then 

n n 

fe=i fe=i 



Example 8.2.2. Suppose f{x) = x on [a,b]. Choose any P G part([a,6]) where 
|jP|| < 2(6 — a)/n. (Convince yourself this is always possible.-^) Make two 
specific selections l^, = Xk-i and r^ = Xk- If x*^ is any other selection from P, 
then l%<x1.< and the fact that / is increasing on [a, b] gives 

3i(/,P,/^)<3?(/,P,4)<3?(/,P,r^). 



This is with the generic partition 
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With this in mind, consider the following calculation. 



n 



(8.1) 



k=l 



n 



= ^{xk - Xfe_i)|4| 



fe=l 



n 



k=l 



n 



<Eii^f 



k=l 



=n\\pr 

4(6 - a)2 



n 



This shows that if a partition is chosen with a small enough norm, all the 
Riemann sums for / over that partition will be close to each other. 

In the special case when P is a regular partition, = {b — a)/n, rk = 
a + k{b — a)/n and 



In the limit as n ^ oo, this becomes the familiar formula (6^ — a^)/2, for the 
integral of f{x) = x over [a, b]. 

Definition 8.2.1. The function / is Riemann integrable on [a, b], if there exists 
a number 3?(/) such that for all e > 0 there is a (5 > 0 so that whenever 
P e part([a,6]) with ||P|| < S, then 



for any selection a;^ from P. 

Theorem 8.2.1. /// : [a, 6] — ^ M and Ji{f) exists, then IR(/) is unique. 



n 



^{f,Py^) = Y,rk\Ik\ 



fc=l 




(6 -a) (a(n-l)+6(n + l)) 
2n 



W)-3i(/,P,4)| <e 
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Proof. Suppose and i?2(/) both satisfy the definition and e > 0. For 

i = 1,2 choose 5i > 0 so that whenever ||P|| < Si, then 

\Ri{f)-R{f,P,xl)\<s/2, 

as in the definition above. If P e part([a, b]) so that ||P|| < 5i A S2, then 

\Riif) - R2if)\ < \Riif) - R{f, P, 4)1 + 1^2 (/) - R{f, P, xl)\<e 

and it follows = i?2(/)- □ 

Theorem 8.2.2. // / : [a, 6] — >■ M and 3?(/) exists, then f is bounded. 

Proof. Left as an exercise. □ 

8.3 Darboux Integration 

A difficulty with handling Ricmann sums is that there are an uncountably in- 
finite number of Riemann sums associated with each partition. One way to 
resolve this problem was shown in Example 8.2.2, where it was shown there 
were largest and smallest Riemann sums associated with each partition. How- 
ever, that's not always the case, so to use that idea, a little more care must be 
taken. 

Definition 8.3.1. Let / : [a, 6] — )• R be bounded and P G part([a, 6]). For each 
Ik determined by P, let 

Mk = lub {f{x) -.x € Ik} and mfe = gib {f{x) : x € 4}. 

The upper and lower Darboux sums for / on [a, b] are 

n n 

D(/,P) = ^Mfe|/fc| and D(/,P) = ^mfe|7fe|. 
fe=i fe=i 

Theorem 8.3.1. If f : [a,b] ^ M. is bounded and P,Q G part([a, 6]) with 
P <^Q, then 

Mf, P) < Mf, Q) < nf, Q) < nf, P). 

Proof. Let P be the generic partition and let Q = PU{x}, where x € {xko-i,Xxo) 
for some ko. Clearly, P <^ Q. Let 

Ml = lub {/(a;) : x e [xka-i,x]} 
m = gib {/(a;) : x G [xko-i,x]} 
Mr = lub{/(x) : X e [x,Xko]} 
rur = glb{/(x) : x G [x,Xko]} 
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Then 

rriko < m < Ml < and < rur < < Mk^ 

so that 

TOfcol4ol = {\{3:ko-i,x]\ + |[x,xfej|) 

< mi\[xko-i,x]\ + mr|[x,xfcj| 

< Mi\[xko-i,x]\ + Mr\[x,Xko]\ 

< Mko\[xko-i,x]\ + Mko\[x,Xko]\ 
= MkM- 

This implies 

n 

fco — 1 n 

= mk\Ik\ + mko\Iko\ + rnk\Ik\ 

k=l k=ko+l 

ktj — l n 

< 'Y mk\Ik\+mi\[xko-i,x]\+mr\[x,Xko]\+ ^ rnk\Ik\ 

fe=l fc=feo+l 

= Mf,Q) 

<n.f,Q) 

kQ — 1 n 

= ^ Mfe|7fc| +M;|[a;fe„_i,a;]| +M^|[a;,a;feJ| + ^ Mk\Ik\ 

k=l k=ko+l 
n 

<^Mfc|7fc| 

The argument given above shows that the theorem holds if Q has one more 
point that P. Using induction, this same technique also shows that the theorem 
holds when Q has an arbitrarily larger number of points than P. □ 

The main lesson to be learned from Theorem 8.3.1 is that refining a parti- 
tion causes the lower Darboux sum to increase and the upper Darboux sum to 
decrease. Moreover, if P,Q G part([a, 6]) and / : [a,b] [—B,B], then, 

m, P) < m, P^Q)< ^(/, P u g) < g). 

Therefore every Darboux lower sum is less than or equal to every Darboux upper 
sum. Consider the following definition with this in mind. 

Definition 8.3.2. The upper and lower Darboux integrals of a bounded function 
/ : [o, 6] M are 

©(/) = gib {©(/, P) : P e part([a, b])} 
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and 

©(/) = lub {©(/, P):Pe part([a, &])}, 

respectively. 

As a consequence of the observations preceding the definition, it follows 
that D(/) > always. In the case = ©(/), the function is said to 

bo Darboux integrable on [a,b], and the common value is written 1){f). The 
following is obvious. 

Corollary 8.3.2. A bounded function f : [a, 6] — )■ M is Darboux integrable if and 
only if for all e > 0 there is a P G part([a, b]) such that D{f, P) — ']^{f, P) < s. 

Which functions are Darboux integrable? The following corollary gives a 
first approximation to an answer. 

Corollary 8.3.3. /// G C([a,6]), then D{f) exists. 

Proof. Let e > 0. According to Corollary 6.6.3, / is uniformly continuous, 
so there is a (5 > 0 such that whenever x,y € [a,b] with |a; — yj < S, then 
|/(x)-/(y)| < e/{b~a). Let P e part([a,6]) with < S. By Corollary 6.5.4, 
in each subintcrval li determined by P, there are x* , y* G I-i such that 

f{x*) = gib {fix) -.xGli} and f{y*) = lub {f{x) : x € li}. 

Since \x* - y*\ < < 5, we see 0 < f{x*) - f{y*) < e/{b - a), for 1 < i < n. 
Then 

S(/)-D(/)<D(/,^')-m^) 

= f^/«)|7,|-f^/(y*)|7,| 

i=l i=l 

n 

= Y.{f{x*)-f{y:)m 

i=l 

n 
i=l 

= e 

and the corollary follows. □ 

This corollary should not be construed to imply that only continuous func- 
tions are Darboux integrable. In fact, the set of integrable functions is much 
more extensive than only the continuous functions. Consider the following ex- 
ample. 

Example 8.3.1. Let / be the salt and pepper function of Example 6.3.6. It 
was shown that C(/) = Q'^. We claim that / is Darboux integrable over any 
compact interval [o, h] . 
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To see this, let e > 0 and A'' e N so that l/N < e/2{h - a). Let 
{qki : I < i < m} = {qk : I < k < N} r\[a,b] 
and choose P G part([a, 6]) such that ||P|| < e/2m. Then 

n 

'5{f,P)=Y,^nh{f{x):xeh)\h\ 

= ^^b{f{x):xGle}\I(\+ lvih{f{x):xGle}\h\ 

< ^{b-a)+m\\P\\ 

< irr, — a)+m- — 

2{b-ay ' 2m 

= e. 

Since f{x) = 0 whenever x e it follows that '^{f,P) = 0. Therefore, 
= W) = 0 and D{f) = 0. 

8.4 The Integral 

There are now two different definitions for the integral. It would be embarass- 
ing, if they gave different answers. The following theorem shows they're really 
different sides of the same coin.^ 

Theorem 8.4.1. Let f : [a, b] R. 

(a) 3?(/) exists iffD{f) exists. 

(b) If^if) exists, then5l{f) =D(/). 

Proof, (a) (=>) Suppose 3?(/) exists and £ > 0. By Theorem 8.2.2, / is 
bounded. Choose P G part([a, 6]) such that 

m)-5i{f,P,xl)\<e/4 

for all selections x% from P. From each Ik, choose Xk and so that 

S £ 

Mk - f{xk) < 7 and f{x^) - ruk < 



4{b-a) " 4(6 -a) 



^Theorem 8.4.1 shows that the two integrals presented here are the same. But, there 
arc many other integrals, and not all of them are equivalent. For example, the well-known 
Lebesgue integral includes all Riemann integrable functions, but not all Lebesgue integrable 
functions are Riemann integrable. The Denjoy integral is another extension of the Riemann 
integral which is not the same as the Lebesgue integral. For more discussion of this, see [9] . 
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Then 



n n 



fc=l /s=l 



n 



Y^{Mk-Xk)\Ik\ 



fe=i 



< 



4(63^^^-") = 4- 



In the same way, 



3l(/,P,ar,)-D(/,P)<£/4. 



Therefore, 



- ©(/) = gib {©(/, Q) : Q e part([a, 6])} - lub {©(/, Q) : Q e part([a, 6])} 
<D(/,P)-^(/,P) 
<(3l(/,P,5f,) + |)-(3^(/,P,a;,)-|) 

< |3?(/,P,Xfe)-3l(/,P,a;fc)| + | 

< |3?(/, P, aJfe) - ai(/) I + |Di(/) - P, + I 



Since e is an arbitrary positive number, this shows that D{f) exists and equals 
3^(/), which is part (b) of the theorem. 

{<=) Suppose / : [a, h] [—B, B], D{f) exists and £ > 0. Since D{f) exists, 
there is a Pi = {ft : 0 < i < m} £ part ([a, 6]) such that 



Set 5 = e/8mB. Choose P € part([a,6]) with ||P|| < 5 and let P2 = P U Pi. 
Since Pi <C P2, according to Theorem 8.3.1, 



Thinking of P as the generic partition, notice that if {xi-i,Xi) fl Pi = 0, then 
©(/, P) and P2) share a common term, Mi|/i|. There are at most m — 
1 instances where (xi_i,Xi) n Pi 7^ 0 and each such intersection generates a 
subinterval of P2 with length less than 6. Therefore, 



< £ 



D{f,Pi)-m,Pi)<l- 



D{f.P2)-nLP2)<l- 



'D{f,P)-'D{f,P2)<{m 




In the same way, 



D(/,P2)-D(/,P)<(m 
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Putting these estimates together yields 
D(/,P)-D(/,P) = 

p) - ©(/, P2)) + (©(/, P2) - Mf, P2)) + mf, p) - m, p)) 

e e e 

This shows that, given £ > 0, there is a (5 > 0 so that < 5 implies 

'5{f,P)-V{f,P)<e. 

Since 

©(/, P) < D{f) < P) and P) < P,<)< P) 

for every selection x* from P, it follows that 'R{f,P,x*) — T){f) < e when 
||P|| < 6. We conclude / is Riemann integrable and = □ 

Prom Theorem 8.4.1, we are justified in using a single notation for both 
and 2)(/). The obvious choice is the familiar f{x) dx, or, more simply, f. 

Example 8.4.1. If / is the salt and pepper function of Example 6.3.6, then 

/ = 0 for any interval [a, b]. 

To see this let e > 0. There is a finite set {qk^ , g/cj , • • • , Qkn } C Q consisting 
of all the rational numbers in [a, b] where f{qkj) > £/2(6 — a). 

8.5 The Cauchy Criterion 

We now face a conundrum. In order to show that /^^ / exists, we must know 
its value. It's often very hard to determine; the; value of an integral, even if 
the integral exists. We've faced this same situation before with sequences. The 
basic definition of convergence for a sequence. Definition 3.1.2, requires the limit 
of the sequence be known. The path out of the dilemma in the case of sequences 
was the Cauchy criterion for convergence. Theorem 3.6.1. The solution is the 
same here, with a Cauchy criterion for the existence of the integral. 

Theorem 8.5.1 (Cauchy Criterion). Let f : [a,b] — )■ M. The following state- 
ments are equivalent. 

(a) f exists. 

(h) Given e > 0 there exists P G part([a, 6]) such that if P <C Qi and P <C Q2, 
then 

\Ji{f,Qi,xl)-Ji{f,Q2,yl)\<e (8.2) 
for any selections from Qi and Q2- 
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Proof. (=^) Assume J f exists. According to Definition 8.2.1, there is a 5 > 0 
such that whenever P e part([a,6]) with ||P|| < 6, then \Ji{f,P,x*)\ < e/2 for 
every selection. If P <^Qi and P <C Q2, then \\Qi \\ < 6, \\Q2\\ < 6 and a simple 
application of the triangle inequality shows 



|3?(/,Qi,4)-3?(/,Q2,2/DI < 



J a J a 



< £. 



(-^=) Let e > 0 and choose P G part([a, 6]) satisfying (8.2) with e/2 in place 
of e. 

We first claim that / is bounded. To see this, suppose it is not. Then 
it must be unbounded on an interval I^^^ determined by P. Fix a selection 
{xk & Ik '■ ^ ^ k < n} and let = Xk for k ^ ko with any element of Iko- 
Then 

I > m,P,xl) - m,P,yl)\ = \f{xkj - fivkoW |4ol • 

But, the right-hand side can be made bigger than e/2 with an appropriate choice 
of yko because of the assumption that / is unbounded on Ik^ . This contradiction 
forces the conclusion that / is bounded. 

Thinking of P as the generic partition and using and as usual with 
Darboux sums, for each fc, choose x'^,y^ € Ik such that 

With these selections, 

n 

P) - P) = Y.^Mk - mk)\Ik\ 
fe=l 

n 

< ^ {\Mk - f{xl)\ + \f{xl) - m)\ + \m) - mk\) 141 
fe=l 

< E (5^ + 1/(4) -/«)! + W 

< I + w/, p. ) - »(/. p. 9^)1 < f 

Corollary 8.3.2 implies !)(/) exists and Theorem 8.4.1 finishes the proof. □ 

Corollary 8.5.2. // /^^ / exists and [c,d] C [a,b], then f exists. 

Proof. Let Pq = {a, b, c, d} e part([a, b]) and e > 0. Choose a partition P^ such 
that Po < Pe and whenever P^ P and P,. ^ P' , then 

m,P,x*k)-5i{f,P',y*k)\<e. 
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Let P} e part([a, c]), e part([c, d]) and P^ G part([rf, &]) so that = 
U P2 U P3. Suppose P2 < Qi and P^ < Qa- Then P^ U Qi U P| for i = 1, 2 
are refinements of P^ and 

|3^(/,Qi,4)-3^(/,Q2,,KI = 

P,i U Qi U P|, 4) - Pi U Q2 U P,^ 4)1 < e 

An apphcation of (8.5.1) shows /^/ exists. □ 

8.6 Properties of the Integral 

Theorem 8.6.1. If f both exist, then 

(a) Ifa,p€ R, then J^{af + pg) exists and f^{af + pg) = aj^f + p g. 

W fa fa (exists. 

(c) /a I /I exists. 

Proof, (a) Let e > 0. If a = 0, in light of Example 8.2.1, it is clear af is 
integrablc. So, assume a 7^ 0, and choose a partition P/ e part([a, b]) such that 
whenever Pf <^ P, then 



/ 



< 



2\a\ 



Then 



:R{af,P,xl)-a / / - Ea/(4)|7fe|-a / / 

Ja Ja 

n .b 

E/(4)I4|- / / 
fe=l -^^ 

5l{f,P,xl)- f f 

J a 



= \ce\ 



< a 



'2\a\ 



£ 

2" 



This shows af is integrable and /^^ af = a J^^ f. 

Assuming /? 7^ 0, in the same way, we can choose a Pg € part([a, 6]) such 
that when Pg <^ P, then 



3^(5, P, 4)- tg 

J a 



< 
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Let Pe = PfLlPg be the common refinement of P/ and Pg, and suppose P^ <^ P. 
Then 



\5i{af + ^g,P,xl)-la [ f + P I 

\ Ja J a 

<|ap(/,P,4)- f f\ + \mi9,P,xl)- f g\ 

J a J a 



This shows a/ + Qg is integrable and (a/ + fig) = a f + P g. 
(b) Claim: If /i exists, then so does jj^ 

To see this, suppose first that 0 < h{x) < M on [a, 6] . If M = 0, the claim 
is trivially true, so suppose M > 0. Let £ > 0 and choose P G part([a, 6]) such 
that 

^{h,P)-Mh,P)<^. 

For each 1 < /c < n, lot 

mfe = glh{h{x) : x € /fc} < \ub{h{x) : x e Ik} = Mi~. 

Since /i > 0, 

ml = glh{h{xf :xGlk}< luh{h{xf : x G h} = M^. 
Using this, we see 



fc=l 

n 

= ^(Mfe + mk){Mk - mk)\Ik\ 

fe=i 

< 2M (fliMk - mk)\Ik\^ 

= 2Af -D(/i,P)) 

< £. 

Therefore, h"^ is integrable when h > 0. 

If /i is not nonnegativc, let m = glb{/i(x) : a < x < b}. Then h — m > 0, 
and h — m is integrable by (a). From the claim, {h — m)^ is integrable. Since 

h"^ = (h- mf + 2mh - m^, 

it follows from (a) that h'^ is integrable. 

Finally, fg = |((/ + g)"^ — (/ — g)^) is integrable by the claim and (a), 
(c) Claim: If /i > 0 is integrable, then so is 
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To see this, let e > 0 and choose P G part ([a, b]) such that 
'D{h,P)-^{h,P) <e^. 

For each 1 < A; < n, let 

ruk = g\h{\/h{x) : X e Ik} < lvih{\/h{x) : x e h} = Mk- 

and define 

A = {k : Mk - mk < e} and B = {k : Mk - ruk > e}. 

Then 

J^{Mk-mk)\Ik\<e{b-a). (8.3) 

Using the fact that > 0, we see that Mk — ruk < Mk + mk, and 

J2(.Mk - mk)\Ik\ < ^Y^iMk + mk){Mk - mk)\Ik\ (8.4) 
fees keB 

= lj2(Mk-ml)\Ik\ 

keB 

<^{V{h,P)-m,P)) 
< e 

Combining (8.3) and (8.4), it follows that 

T5{y/h, P) - 1)(\//^, P)<e{b^a) + e = e((6 - a) + 1) 

can be made arbitrarily small. Therefore, ^/h\s integrable. 

Since |/| = \/~P an application of (b) and the claim suffice to prove (c). □ 

Theorem 8.6.2. If f exists, then 

(a) Iff>Oon [a,b], then jj^ f > 0. 

(b) I /:/!</: I/I 

(c) Ifa<c< b, then f = J^f + J^f. 

Proof, (a) Since all the Riemann sums are nonnegative, this follows at once, 
(b) It is always true that |/|±/ > 0 and \f\-f > 0, so by (a), J^{\f\+f) > 0 
/a (I/I - /) ^ 0- Rearranging these shows - f < \f\ and f < \f\. 
Therefore, | f\ < |/|, which is (b). 
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(c) By Corollary 8.5.2, all the integrals exist. Let e > 0 and choose Pi e 
part([a,c]) and Pr S part([c, 6]) such that whenever Pi <C Qi and Pr <C Qr, 
then, 



,Qi,xl) - / 

J a 



< - and 



m,Qr,y*k)- I f 

J c 



< 



If P = PiUPr and Q = QiU Qr, then P,Q € part([a,6]) and P <^ Q. The 
triangle inequality gives 



3^(/,Q,4)- / /- / f <e. 

J a J c 

Since every refinement of P has the form QiUQr, part (c) follows. □ 

8.7 The Fundamental Theorem of Calculus 

Theorem 8.7.1 (Fundamental Theorem of Calculus 1). Suppose f, F : [a, b] — )• 

M satisfy 

(a) f exists 
(h) F €C{[a,b])f^D{{a,h)) 
(c) F'{x)^f{x), Vx€(a,6) 
Thenj;^f = F{b)-F{a). 

Proof. Let £ > 0 and choose P^ G part ([a, b]) such that whenever P^ <C P, then 

^f,P,xl)- [ f <e. 

J a 

On each interval [xk-\,Xk\ determined by P, the function F satisfies the condi- 
tions of the Mean Value Theorem. (See Corollary 7.4.3.) Therefore, for each k, 
there is an x^. e {xk-i,Xk) such that F{xk) — F{xk-i) = F'{xl){xk - Xk-i) = 

m)\ik\. So, 



/ - im - F{a) 



f-J2iFi^k)-Fixk-i) 
fc=i 

/-^/(4)|4| 
fe=i 

f f-nf,P,xl) 

J a 



< e 



and the theorem follows. 



□ 
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Corollary 8.7.2 (Integration by Parts). If f,g G C{[a,b]) D D{{a,b)) and both 
f'g and fg' are integrable on [a,b], then 

f f9'+ f r9 = mg{b)-f{a)g{a). 

J a J a 



Proof. Use Theorems 7.2.1(c) and 8.7.1. □ 

Example 8.7.1. Suppose / and its first n derivatives are all continuous on [a, 6]. 
There is a function Rn{x,t) such that 

f{x) = J2^^i^-i)' + ^n{x,t) 
k=0 

ioT a < t < b. Differentiate both sides of the equation with respect to t to get 

-R (X t) - _ (^~^)"~^ f(n)(.) 

Using Theorem 8.7.1 gives 

Rn{x, C) = Rn{x, C) - Rn{x, x) 

= I ^Rn{x,t)dt 

which is the integral form of the remainder from Taylor's formiila. 

Suppose f exists. By Corollary 8.5.2, / is integrable on every interval 
[a,x], for X e [a,b]. This allows us to define a function F : [a,6] — )• R as 
F{x) = f, called the indefinite integral of / on [a, b]. 

Theorem 8.7.3 (Fundamental Theorem of Calculus 2). Let f be integrable on 
[a,b] and F be the indefinite integral of f. Then F e C{[a,b]) and F'{x) = f{x) 
whenever x G C{f) n (a, b). 



Proof. To show F e C{[a,b]), let xq e [a, 6] and £ > 0. Since jj"/ exists, 
there is an M > lub{|/(x)| : a < x < b). Choose 0 < ^ < e/M and x e 
{xq — (5, + (5) n [a, b\. Then 



\F{x)-F{xo)\ 
and xo € C{F). 



f 



< Mix - xo\ < MS < e 
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Let Xf) S C(/) n (a, b) and e > 0. There is a 5 > 0 such that x £ {xo — S, xq + 
6) C (a, b) implies \ f{x) - f{xo)\ < e. E 0 < h < S, then 



F{xo + h)- F{xo) 



h 



< 



< 



xa + h 

Xo 

xo + h 

Xo 

xo + h 



{f{t)-f{xo))dt 



\f{t)~ f{xo)\dt 



Xq 

Xa + h 



edt 



This shows i^^(a;o) — /{xq). It can be shown in the same way that F^{xq) = 
f{xQ). Therefore F'{xo) = f{xo). □ 

The right picture makes Theorem 8.7.3 almost obvious. Suppose x £ C{f) 
and e > 0. There is a 5 > 0 such that 

!{{x -d,x + d)n [a, b]) C if{x) - e/2, f{x) + e/2). 

Let 

m = gib {fy ■.\x-y\<5}< lub {fy ■.\x~y\<5} = M. 




X + h 



Apparently M ~ m < e and for 0 < ft- < (5, 
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Since M — m^Oas/i^O, a "squeezing" argument shows 

hio h ' 

A similar argument establishes the limit from the left and F'{x) = f{x). 

It's easy to read too much into the Fundamental Theorem of Calculus. We 
are tempted to start thinking of integration and diffcrcntian as oppositcs of 
each other. But, this is far from the truth. The operations of integration 
and antidifferentiation are different operations, that happen to sometimes be 
tied together by the Fundamental Theorem of Calculus. Consider the following 
examples. 

Example 8.7.2. Let 

\x\/x, X ^0 
0, a; = 0 



It's easy to prove that / is intcgrablc over any compact interval, and that 
F{x) = J^^ / = |a;| — 1 is an indefinite integral of /. But, F is not differentiable 
at a; = 0 and / is not a derivative, according to Theorem 7.4.6. 

Example 8.7.3. Let 

x^sin^, x^O 
0, X = 0 



m 

It's straightforward to show that / is differentiable and 

2x sin ^ — - cos ^ , x ^ 0 



0, a; = 0 



Since /' is unbounded near x = 0, it follows from Theorem 8.2.2 that /' is not 
integrable over any interval containing 0. 

Example 8.7.4. Let / be the salt and pepper function of Example 6.3.6. It was 

shown in Example 8.4.1 that ./ = 0 on any interval [a,b]. If F{x) = Jq f, 
then F{x) = 0 for ah x and F' = f on C{f) = Q". 

8.8 Integral Mean Value Theorems 

Theorem 8.8.1. Suppose f,g : [a,b] — >■ M are such that 

(a) g{x) > 0 on [a, b], 

(b) f is bounded and m < /(x) < M for all x G [a,b], and 

(c) f and fg both exist. 
There is a c & [m, M] such that 

rb rb 



ro no 

/ f9 = c g. 

J a J a 
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Proof. Obviously, 



m [ g< [ f9<M I g. (8.5) 

J a J a 



If = 0, we're done. Otherwise, let 

La 

Then fg = cj^ g and from (8.5), it follows that m < c< M. □ 

Corollary 8.8.2. Let f and g be as in Theorem 8.8.1, hut additionally assume 
f is continuous. Then there is a c € (a, b) such that 



rb rb 

/ fg = /(c) / g. 



Proof. This follows from Theorem 8.8.1 and Corollaries 6.5.4 and 6.5.7. □ 
Theorem 8.8.3. Suppose f, 5 : [o, 6] — )• R are such that 
(a) g{x) > 0 on [a, 6], 

(h) f is hounded and m < f{x) < M for all x G [a, b], and 
(c) f^f and fg both exist. 
There is a c £ [a, b] such that 

rb PC pb 

/ fg = m g + M g. 

Ja Ja Jc 

Proof. For a < x < 6 let 

G{x) =m j g + M I g. 
By Theorem 8.7.3, G e C{[a,b\) and 

ph pb pb 

g\hG<G{h)=m g< fg<M g = G{a)<luhG. 

J a J a J a 

Now, apply Corollary 6.5.7 to find c where G(c) = fg. □ 
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Chapter 9 

Sequences of Functions 



9.1 Pointwise Convergence 

We have accumulated much experience working with sequences of numbers. The 
next level of complexity is sequences of functions. This chapter explores several 
ways that sequences of functions can converge to another function. The basic 
starting point is contained in the following definitions. 

Definition 9.1.1. Suppose 5 C M and for each n G N there is a function 
/„ : 5 ^ R. The collection {/„ : n e N} is a sequence of functions defined on S. 

For each fixed x G S, fn{x) is a sequence of numbers, and it makes sense to 
ask whether this sequence converges. If fn{x) converges for each x G S, a new 
function / : 5 — > M is defined by 

f{x) = lim fn{x). 



The function / is called the pointwise limit of the sequence /„, or, equivalently, 
it is said /„ converges pointwise to f. This is abbreviated /„ — >f, or simply 
fn f, if the domain is clear from the context. 

Example 9.1.1. Let 

'O, x<0 

fn{x) 



0 < a; < 1 . 
a; > 1 



Then fn^f where 




(See Figure 9.1.) This example shows that a pointwise limit of continuous func- 
tions need not be continuous. 
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Figure 9.2: The first four functions from the sequence of Example 9.1.2. 



Example 9.1.2. For each n G N, define /„ 



by 



fnix) = 



1 + n^x 



(See Figure 9.2.) Clearly, each /„ is an odd function and lim|j.|_j.oo fn{x) = 0. A 
bit of calculus shows that /„(l/n) = 1/2 and fn{—l/n) = —1/2 are the extreme 
values of /„. Finally, if a; 7^ 0, 



\fnix)\ 



nx 
1 + ri^x^ 



< 



1 

nx 



implies /„ — > 0. This example shows that functions can remain bounded away 
from 0 and still converge pointwise to 0. 
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60 - 
50 - 
40 - 
30 - 
20 - 



0.2 



0.4 



0.6 



0.8 



1.0 



Figure 9.3: The first four functions from the sequence of Example 9.1.3. 



Example 9.1.3. Define /„ 



fnix) 



by 



0, 



2"+i 



< X < 



2T1+2 < X < 

otherwise 



To figure out what this looks like, it might help to look at Figure 9.3. 

The graph of /„ is a piecewise linear function supported on [1/2"+^, 1/2"] 
and the area under the isoceles triangle of the graph over this interval is 1. 
Therefore, fn = ^ for all n. 

If a; > 0, then whenever x > 1/2", we have fn{x) — 0. From this it follows 
that /„ 0. 

The lesson to be learned from this example is that it may not be true that 



limr. 



Jq fn — Jq lillln— foo fn 



Example 9.1.4. Define /„ 



by 



fnix) = 




X < ^ 

\x\ > h 



(See Figure 9.4.) The parabolic section in the center was chosen so /„(±l/7i) = 
1/n and /4(±l/n) = ±1. This splices the sections together at (±l/n, ±l/n) so 
/„ is differentiable everywhere. It's clear /„ — >■ \x\, which is not differentiable 
at 0. 

This example shows that the limit of differentiable functions need not be 
differentiable. 

The examples given above show that continuity, integrability and differen- 
tiability are not preserved in the pointwise limit of a sequence of functions. To 
have any hope of preserving these properties, a stronger form of convergence is 
needed. 
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-1.0 



Figure 9.4: The first ten functions / from Example 9.1.4. 

9.2 Uniform Convergence 

Definition 9.2.1. Tlie sequence f„-S^M. converges uniformly to f : S ^ R 
on S, if for eacli e > 0 tliere is an iV G N so tliat whenever n > N and x G S, 
then \fn{x) - f{x)\ < e. 

In this case, we write /„ f, or simply /„ ^ /, if the set S is clear from 
the context. 




Figure 9.5: |/n(x) — f{x)\ < e on [a,b], as in Definition 9.2.1. 



The difference between pointwise and uniform convergence is that with point- 
wise convergence, the convergence of /„ to / can vary in speed at each point of 
S. With uniform convergence, the speed of convergence is roughly the same all 
across S. Uniform convergence is a stronger condition to place on the sequence 
/„ than pointwise convergence in the sense of the following theorem. 

Theorem 9.2.1. ///„ ^ /, then /„^/. 
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Proof. Let xq € S* and e > 0. There is an iV € N such that when n > N , 
then \ f{x) — fn{x)\ < s for ah x £ S. In particular, \f{xo) — fn{xo)\ < e when 
n > N. This shows fn{xo) f{xo)- Since xq & S is arbitrary, it follows that 



The first three examples given above show the converse to Theorem 9.2.1 
is false. There is, however, one interesting and useful case in which a partial 
converse is true. 

Definition 9.2.2. If /„ — >f and t f{x) for all x G S, then /„ increases 

to / on S. If /„ — >f and fn{x) i f{x) for all x G S, then /„ decreases to f on 
S. In either case, /„ is said to converge to / monotonically. 

The functions of Example 9.1.4 decrease to \x\. Notice that in this case, 
the convergence is also happens to be uniform. The following theorem shows 
Example 9.1.4 to be an instance of a more general phenomenon. 

Theorem 9.2.2 (Dini's Theorem). If 

(a) S is compact, 

(b) /„ — >f monotonically, 

(c) /„ e C{S) for all n e N, and 

(d) / e C{S), 
then fn ^ f. 

Proof. There is no loss of generality in assuming fn i f, for otherwise we con- 
sider — /„ and — /. With this assumption, if gn = fn — f, then gn is a sequence 
of continuous functions decreasing to 0. It suffices to show <?„ ^ 0. 

To do so, let £ > 0. Using continuity and pointwise convergence, for each 
X G S find an open set containing x and an G N such that gN^iy) < £ 
for all y G Gx. Notice that the monotonicity condition guarantees gn{y) < £ for 
every y G Gx and n> Nx. 

The collection {Gx : x G S} is an open cover for 5*, so it must contain a finite 
subcovcr {Gxi : 1 < i < n}. Let N — max{Nx- : 1 < i < n} and choose m > N. 
If X G S, then x G Gx^ for some i, and 0 < gm{x) < gnix) < gNi{x) < s. It 
follows that gn ^ 0. □ 



9.3 Metric Properties of Uniform Convergence 

If 5 C M, let B{S) = {f : S ^ R : f is bounded}. For / e B{S), define 
= lub{|/(a;)| : x G S}. (It is abbreviated to ||/||, if the domain S is clear 
from the context.) Apparently, ||/|| > 0, ||/|| = 0 <;=^ / = 0 and, if 5 e B{S), 
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then 11/ - g\\ = \\g - f\\. Moreover, if /i e B{S), then 

\\f-g\\=luh{\f{x)-g{x)\:xGS} 

< lub{|/(a;) - h{x)\ + \h{x) - g{x)\ : x e S} 
<luh{\f{x)^h(x)\ ■.xGS}+\uh{\h{x)~g{x)\ : x G S} 

= \\.f~h\\ + \\h-g\\ 

Combining all this, it follows that ||/ — g\\ is a metric^ on B{S). 

The definition of uniform convergence implies that for a sequence of bounded 
functions /„ : 5 ^ K, 

fn^f^ Wfn - /II ^ 0. 

Because of this, the metric ||/ — is often called the uniform metric or the sup- 
metric. Many ideas developed using the metric properties of M can be carried 
over into this setting. In particular, there is a Cauchy criterion for uniform 
convergence. 

Definition 9.3.1. Let 5 C M. A sequence of functions /„ : /S — )• K is a Cauchy 
sequence under the uniform metric, if given s > 0, there is an A'' S N such that 
when m,n > N , then ||/„ — fm\\ < e. 

Theorem 9.3.1. Let fn & B{S). There is a function f G B{S) such that 
fn ^ / iff fn is a Cauchy sequence in B{S). 

Proof (^) Let /„ ^ / and £ > 0. There is an AT e N such that n > N 
implies ||/„ - /|| < s/2. If m > A and n > A, then 

ll/m - /nil < ll/m - /|| + II / - /n || < | + | = £ 

shows /„ is a Cauchy sequence. 

{<=) Suppose /„ is a Cauchy sequence in B{S) and £ > 0. Choose A e N 

so that when \\fm — fn\\ < £ whenever m > N and n > N. In particular, for 
a fixed xq G S and m,n > N, \fm{xo) - fn{xo)\ < \\fm - fn\\ < £ shows the 
sequence fn{xo) is a Cauchy sequence in M and therefore converges. Since xq is 
an arbitrary point of S*, this defines an / : S' — )■ R such that /„ — >■ /. 

Finally, if m, n > A and x G S the fact that |/„(a;) — /„(a;)| < £ gives 

l/n(a;) - /(a;)| = lim \fnix) - fm{x)\ < e. 

This shows that when n > N, then ||/„ — /|| < e. We conclude that / S B{S) 
and fn ^ f. □ 

A collection of functions S is said to be complete under uniform convergence, 
if every Cauchy sequence in § converges to a function in S. Theorem 9.3.1 shows 
B{S) is complete under uniform convergence. We'll see several other collections 
of functions that are complete under uniform convergence. 

^Definition 2.2.3 
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9.4 Series of Functions 

The definitions of pointwise and uniform convergence are extended in the natural 
way to series of functions. If J2T=i fk is a series of functions defined on a 
set S, then the series converges pointwise or uniformly, depending on whether 
the sequence of partial sums, s„ = J2k=i fk converges pointwise or uniformly, 
respectively. It is absolutely convergent or absolutely uniformly convergent, if 
J2^=i \fn\ is convergent or uniformly convergent on S, respectively. 
The following theorem is obvious and its proof is left to the reader. 

Theorem 9.4.1. Let fn a series of functions defined on S . IfY^'^=i fn 

is absolutely convergent, then it is convergent. If fn is absolutely uni- 

formly convergent, then it is uniformly convergent. 

The following theorem is a restatement of Theorem 9.2.2 for series. 

Theorem 9.4.2. If fn is a series of nonnegative continuous functions 

converging pointwise to a continuous function on a compact set S, then Y^'^=i fn 
converges uniformly on S. 

A simple, but powerful technique for showing uniform convergence of series 
is the following. 

Theorem 9.4.3 (Wcicrstrass M-Tcst). ///„ : S ^M. is a sequence of functions 
and M„ is a sequence nonnegative numbers such that \\fn\\s < -^n for all n €N 
and X^^i Mn converges, then J2'^=i fn is absolutely uniformly convergent. 

Proof. Let e > 0 and s„ be the sequence of partial sums of Yl'^=i \fn\- There is 
an N such that when n>m> N, then < s. So, 



< 



;=m+l 



n 



< E IIMI< E^^^<^- 



fe=m+l fe= 



This shows s„ is a Cauchy sequence in B{S) and must converge according to 
Theorem 9.3.1. □ 



9.5 Continuity and Uniform Convergence 

Theorem 9.5.1. // /„ : 5 — > M such that each fn is continuous at xq and 
then f is continuous at xq. 

Proof. Let £ > 0. Since /„ \ /, there is an A'' e N such that whenever n> N 
and X £ S, then \fn(x) — f{x)\ < e/3. Because /at is continuous at Xq, there is 
a (5 > 0 such that x € {xq — d,xo + 6)riS implies |/Ar(a;) — /Ar(a;o)| < e/3. Using 
these two estimates, it follows that when x G {xq — 6,xo + S) (1 S, 

\f{x) - f{xo)\ = \f{x) - fN{x) + fN{x) - fN{X0) + fN{Xo) - f{xo)\ 

< \f{x) - fN{x)\ + If^ix) - fNixo)\ + l/jvK) - /(a;o)| 
<e/3 + £/3 + e/3 = s. 

Therefore, / is continuous at xq. □ 
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The following corollary is immediate from Theorem 9.5.1. 

Corollary 9.5.2. // /„ is a sequence of continuous functions converging uni- 
formly to f on S , then f is continuous. 

Example 9.1.1 shows that continuity is not preserved under pointwise con- 
vergence. Corollary 9.5.2 establishes that if S' C M, then C{S) is complete under 
the uniform metric. 

The fact that C([a, b]) is closed under uniform convergence is often useful be- 
cause, given a "bad" function / g C([a,6]), it's often possible to find a sequence 
/„ of "good" functions in C{[a,b]) converging uniformly to /. 

Theorem 9.5.3 (Weierstrass Approximation Theorem). If f ^ C{[a^b]), then 
there is a sequence of polynomials Pn ^ /. 

To prove this theorem, we first need a lemma. 



Lemma 9.5.4. For n ^ N let c, 



■n 




and 




1^1 <1 
\t\>l 



(See Figure 9.6.) Then 



(a) kn{t) >0 on [-1, 1] for all n e N; 



(b) J^^kn — 1 for all n G N; and, 



(c) if0<5< 1, then k, 



n 



0 on [-1,-5] U [5,1]. 



-1.0 




-0.5 



0.5 



1.0 



Figure 9.6: Here are the graphs of fc„(i) for n = 1,2, 3, 4, 5. 
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Proof. Parts (a) and (b) follow easily from the definition of A;„. 
To prove (c) first note that 

/I fi/Vn 2 / 1 

kn > / C„(l - > C„— 1 - - 

-1 i-i/v^ Vn\ n 



Since (l — t i, it follow that there is an a > 0 such that c„ < a^n? 
Letting 8 G (0, 1) and (5 < f < 1, 

A;n(i) < K{S) < a^/n{l - <5=^)" ^ 0 

by L'Hospital's Rule. Since kn is an even function, this establishes (c). □ 

A sequence of functions satisfying conditions such as those in Lemma 9.5.4 

is called a convolution kernel or a Dirac sequence."^ Several such kernels play 
a key role in the study of Fourier series, as we will see in Theorems 10.3.1 and 
10.5.1. 

We now turn to the proof of the theorem. 

Proof. There is no generality lost in assuming [a,b] = [0,1], for otherwise we 
consider the linear change of variables g{x) = f{{b — a)x + a). Similarly, we 
can assume /(O) = /(I) = 0, for otherwise we consider g{x) = f{x) — ((/(I) — 
/(0))a;+/(0), which is a polynomial added to /. We can further assume f{x) = 0 
when X ^ [0, 1]. 
Set 

Pn{x) = j J{X + t)kn{x) dt. (9.1) 

To see p„ is a polynomial, change variables in the integral using u = x + t to 
arrive at 

Pn{x) = I f{u)kn{u-x)du= / f {u)kn{x - u) du, 
Jx-1 Jo 

because f{x) = 0 when x ^ [0, 1]. Notice that kn{x — m) is a polynomial in u 
with coefficients being polynomials in x, so integrating f{u)kn{x — u) yields a 
polynomial in x. (Just try it for a small value of n and a simple function /!) 



It is interesting to note that with a bit of work using complex variables one can prove 
r(n + 3/2) n + 1/2 n - 1/2 n-3/2 3/2 

Cn = —7= — ; = — X — X — X • • • X . 

V7fr(n + 1) n n-1 n-2 1 

^Given two functions / and g defined on R, the convolution of / and g is the integral 

/oo 
f{t)g(x - i) dt. 
-OO 

The term convolution kernel is used because such kernels typically replace g in the convolution 
given above, as can be seen in the proof of the Weierstrass approximation theorem. 
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Use (9.1) and Lemma 9.5.4(b) to see for 6 G (0, 1) that 

|p„(x) - f{x)\ = J J{x + t)K{t) dt - fix) 

1 

ifix + t)- f{x))kn{t) dt 



< J ^\f{x + t)- f{x)\kn{t)dt 



\fix + t)- f{x)\kn{t)dt+ \f{x + t)- f{x)\kn{t)dt. (9.2) 

5 Js<\t\<\ 



We'll handle each of the final integrals in turn. 

Let £ > 0 and use the uniform continuity of / to choose a 5 e (0, 1) such 
that when \t\ < 5, then \f{x + t) - f{x)\ < e/2. Then, using Lemma 9.5.4(b) 
again, 

J^^ \f{x + t)- f{x)\k„{t) dt<^- fc„(f) dt<^- (9.3) 

According to Lemma 9.5.4(c), there is an e N so that when n > N and 
\t\ > S, then kn{t) < g(||y^||_^'^i)(i_g) . Using this, it follows that 



/ \f{x + t)-f{x)\kn{t)dt 

J5<\t\<l 

= j ^ \f{x + t)- f{x)\kn{t)dt + \f{x + t)- f{x)\kn 



{t)dt 



<2|| 



kn{t)dt + 2\\ 



dt 



< ^ii/ii sd/ii/iia-.) " - + ^ii/ii s(ii/ii/i)(i-.) " - - 1 (»^*) 

Combining (9.3) and (9.4), it follows from (9.2) that \Pn{x) — i{x)\ < e for all 
X e [0, 1] and p„ ^ /. □ 

Corollary 9.5.5. If f £ C{[a, b]) and s > 0, then there is a polynomial p such 
that ||/-p||[a,6] < e- 

The theorems of this section can also be used to construct some striking 
examples of functions with unwelcome behavior. Following is perhaps the most 

famous. 



Example 9.5.1. There is a continuous / : 



that is differentiable nowhere. 
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0.5 1.0 1.5 

Figure 9.7: sq, si and S2 from Example 9.5.1. 




Figure 9.8: The nowhere differentiable function / from Example 9.5.1. 



Proof. Thinking of the canonical example of a continuous function that fails to 
be differentiable at a point-the absolute value function-we start with a "saw- 
tooth" function. (See Figure 9.5.) 



X — 2n, 



2n < a; < 2n -fl, n G Z 



2n + 2 - 2n -1- 1 < x < 2n -I- 2, n e Z 



Notice that Sq is continuous and periodic with period 2 and maximum value 1. 
Compress it both vertically and horizontally: 



Sn (4"a;) , n e N. 



Each Sn is continuous and periodic with period p„ = 2/4" and ||s„|| = (3/4)'^ 
Finally, the desired function is 



n=0 



Since ||s„|| = (3/4)", the Weierstrass M-test implies the series defining / is 
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uniformly convergent and Corollary 9.5.2 shows / is continuous on M. We will 
show / is differentiable nowhere. 

Let a; e M, m e N and hm = 1/(2 • 4™). 

If n > m, then hm/Pn = 4"~"*~^ e N, so s„(a; ± hm) — Sn{x) = 0 and 



f{x ± hm) - f{x) 

±hrr,. 



= E 



k=0 



Sk{x±hm) - Skjx) 
±hrr,. 



(9.5) 



On the other hand, if n < m, then a worst-case estimate is that 

Sn{x±hm) - Sn{x) 



This gives 



E 

fe=0 



hr. 



Sk{x ± hm) - Sk{x) 



< 



I 



= 3" 



m— 1 
fe=0 

3™ - 



Sk{x±hm) - Sk{x) 



±hr, 



< 



1 3™ 
3- 1 ^ T 



(9.6) 



Since Sm is linear on intervals of length 4 "* = 2 • hm with slope ±3™ on 
those linear segments, at least one of the following is true: 



Sm{x + hm) - S{x) 



hr, 



= 3™ or 



s{x) 



3™. 



(9.7) 



Suppose the first of these is true. The argument is essentially the same in the 
second case. 

Using (9.5), (9.6) and (9.7), the following estimate ensues 



fix + h,,,:)- fix) 



hr, 



E 

k=0 
m 

E 



hr. 



Sk{x + hm) -Sk{x) 



> 



Sm{x + hm) - S{x) 



m — 1 

E 



k=0 



hjYi 

Since 3'"/2 — >■ oo, it is apparent ]' [x) does not exist. 



Sfc(x ± hm) - Sk(x) 



±hr, 



> 3" 



□ 



There are many other constructions of nowhere differentiable continuous 
functions. The first was published by Weierstrass [16] in 1872, although it was 
known in the folklore sense among mathematicians earlier than this. (There is 
an English translation of Weierstrass' paper in [8].) In fact, it is now known in a 
technical sense that the "typical" continuous function is nowhere differentiable 
[4]. 
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9.6 Integration and Uniform Convergence 

One of the recurring questions with integrals is when it is true that 

lim fn= lim /„. 

n—>oc J J n^oo 

This is often referred to as "passing the limit through the integral." At some 
point in her career, any student of advanced analysis or probability theory will 
be tempted to just blithely pass the limit through. But functions such as those 
of Example 9.1.3 show that some care is needed. A common criterion for doing 
so is uniform convergence. 

Theorem 9.6.1. ///„ : [a, 6] — ^ M such that /„ exists for each n and fn ^ 
/ on [a,b], then 

/ /= hm / /„ 

a a 



Proof. Some care must be taken in this proof, because there are actually two 
things to prove. Before the equality can be shown, it must be proved that / is 
integrable. 

To show that / is integrable, let £ > 0 and N G N such that ||/ — /jv|| < 
£/3(6 - a). If P e part([a, b]), then 

n n 

fe=i fc=i 

= lE(/(4)-/iv(4))l4|| 
fc=i 

<El/(4)-/iv(4)ll4l 

k=l 

n 

£ 

~ 3 

According to Theorem 8.5.1, there is a P e part([a, 6]) such that whenever 
P < Qi and P < Q2, then 

\nfN,Qi,xl)-nfN.Q2,yl)\ < |. (9.9) 
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Combining (9.8) and (9.9) yields 



\0l{f,Q,,xl)-3l{f,Q2,yl)\ 

= \^if, Qi,xl) - 3?(/iv, Qi,xl) + JlifN, Oi, .4) 

-nfN,Qi,xl) + niN.Q2.yk) - m, Q2,y*k)\ 

< |3i(/, Qi,xl) - :k{fN, Qi, 4)1 + Wn.Qu xl) - n.fN, Qi,xl)\ 

+ \5i{fN,Q2,yl)-m, Q2,y*k)\ 
e e e 

<3+3+3=^ 

Another application of Theorem 8.5.1 shows that / is integrable. 
Finally, 











/ / - / In 




f\f - In) 


</ 


J a J a 




J a 


J a 



la 3(6 -a) 3 



shows that /„ f. 



□ 



Corollary 9.6.2. IfY^^^^i /" ^ series of integrable functions converging uni- 
formly on [a,h\, then 



„b 00 00 ,.5 

Jo- „=1 71=1-'° 



Combining Theorem 9.6.1 with Dini's Theorem, gives the following. 

Corollary 9.6.3. If fn is a sequence of continuous functions converging mono- 
tonically to a continuous function f on [a,b], then fn la f- 



9.7 Uniform Convergence and Differentiation 

The relationship between uniform convergence and differentiation is somewhat 
more complex than those we've already examined. First, because there are two 
sequences involved, /„ and /4, either of which may converge or diverge at a 
point; and second, because differentiation is more "delicate" than continuity or 
integration. 

Example 9.1.4 is an explicit example of a sequence of differentiable functions 
converging uniformly to a function which is not differentiable at a point. The 
derivatives of the functions from that example converge pointwise to a function 
that is not a derivative. The Weierstrass Approximation Theorem and Example 
9.5.1 push this to the extreme by showing the existence of a sequence of poly- 
nomials converging uniformly to a continuous nowhere differentiable function. 

The following theorem starts to shed some light on the situation. 

Theorem 9.7.1. If fn is a sequence of derivatives defined on [a,b] and fn ^ 
/, then f is a derivative. Moreover, if the sequence Fn S C{[a,b]) satisfies 
= /„ and F„(a) = 0, then Fn ^ F where F' = f. 
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Proof. For each n, let Fn be an antiderivative of /„. By considering Fn{x) — 
Fn{a), if necessary, there is no generaUty lost with the assumption that Fn{a) = 
0. 

Let e > 0. There is an A/' e N such that 

m,n>N 11/;^ - /„|| < ^. 

If X G [a, b] and m,n > N, then the Mean Value Theorem and the assumption 
that Fm{a) = -F„(a) = 0 yield a c S [a, b] such that 

\Fm{x) - Fn{x)\ = |(F„(,x) - - (F,„(a) - F„(a))| 

= l/m(c) - /„(c)| a| < - /„||(6- a) < e. 

This shows F„ is a Cauchy sequence in C([a, 6]) and there is an F e C([a, 6]) 
with F„ =^ F. 

It suffices to show F' = f. To do this, several estimates are established. 
Let M e N so that 

m,n> M \\fm-fn\\ < |. 

Notice this implies 

11/ - /nil < |, Vn>M. (9.10) 

For such m,n > M and x,y G [a, b] with x ^ y, another application of the 
Mean Value Theorem gives 

Fn{:r) - F„{y) F„,{x) - F,n{y) 
X — y X — y 

\x y\ 

= l/nW - fmic)\ \x-y\< Wfn " fm\\ < ^. 

\x-y\ 3 

Letting m ^ oo, it follows that 

Fn{x)-Fn{y) F{x)-F{y) 
X — y X — y 

Fix n> M and a; G [a, 6]. Since F^{x) = fn{x), there is a (5 > 0 so that 

< |,Vt/e (a;-5,a; + 5)\{a;}. (9.12) 



< -, Vn > M. 
-3 



(9.11) 



Fn{x) - Fnjy) 
x-y 



fnix) 
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Finally, using (9.11), (9.12) and (9.10), we see 
Fix) - F{y) 



x-y 



fix) 



F{x) - F{y) Fn{x) - Fn{y) 



X — y X — y 

^Fr,{x) - Fn{y) 



< 



x-y 

F{x)-F{y) F^{x)-Fn{v) 
X — y X — y 

F^jx) - F^jy) 
x-y 



.fn{x) + fn{x) - f{x) 



+ 



- fn{x) 



+ \fn{x)-f{x)\ 



e e e 
<3 + 3 + 3=^- 



This establishes that 



as desired. 



hm = /(a;), 

y^x x — y 



□ 

g and 



Corollary 9.7.2. If Gn & C{[a,h]) is a sequence such that G'^ 
Gn{xo) converges for some xq € [a,b], then Gn ^ G where G' = g. 

Proof Let F„ and F be as in Theorem 9.7.2. Then F' - G' = ,9„ - ,9„ = 0. so 
Gn = Fn + OLn for some constant a„. In particular, a„ = G„(a;o) — -F„(a;o) —J- a, 
for some a e M, because both G„ and F„ converge at xq. 

Let e > 0. Since both F„ and a„ are Cauchy sequences, there exists an 
A'' e N such that 

£ £ 

m,n> N => \\Fn - Fm\\ < - and |a„ - < -. 

U m,n > N and x e [a, b] , then 

\Gn{x) - Gm{x)\ = \{F„{x) + a„) - {F^{x) + a™)| 
< \Fn{x) - F,nix)\ + \an - ami 



< \\Fn 



+ 2<^- 



This shows G„ is a Cauchy sequence in G{[a,b]) and therefore G„ 
clear that G = F + a, so G" = F' = g. 



G. It is 

□ 



Corollary 9.7.3. ///„ is a sequence of differentiable functions defined on [a,b] 
such that f{xo) exists for some xq e [a, b] and X^^i fn converges uni- 

formly, then 



fe=l 



Proof. Left as an exercise. 



□ 
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9.8 Power Series 

9.8.1 The Radius and Interval of Convergence 

One place where uniform convergence plays a key role is with power series. 
Recall the definition. 

Definition 9.8.1. A power series is a function of the form 

oo 

/(x) = ^a„(x-c)". (9.13) 

71=0 

The domain of / is the set of all x at which the series converges. The constant 
c is called the center of the series. 

To determine the domain of (9.13), let a; € IR \ {c} and use the root test to 

see the series converges when 

limsup \an{x - c)"|^/" = \x - c\ limsup |a„|^/" < 1 
and diverges when 

\x - c\ limsup |a„|^/" > 1. 

If r limsup |a„|^/" < 1 for some r > 0, then these inequalities imply (9.13) is 
absolutely convergent when |a; — c| < r. In other words, if 

i? = lub{r :rlimsup|a„|i/" < 1}, (9.14) 

then the domain of (9.13) is an interval of radius R centered at c. The root test 
gives no information about convergence when |a; — c| = R. This R is called the 
radius of convergence of the power series. Assuming i? > 0, the open interval 
centered at c with radius R is called the interval of convergence. It may be 
different from the domain of the series because the series may converge at one 
endpoint or both endpoints of the interval of convergence. 

The ratio test can also be used to determine the radius of convergence, but, 
as shown in (4.8), it will not work as often as the root test. When it does, 



R = lub {r : r lim sup 



< 1}. (9.15) 



Example 9.8.1. Calling to mind Example 4.1.2, it is apparent the geometric 
power series X^^o center 0, radius of convergence 1 and domain (—1, 1). 

Example 9.8.2. For the power series J2'^=i 2"(a; + 2)"/n, we compute 

2n \ -j^ 



lim sup — =2 =^ R = 
\nj 2 

Since the series diverges when x = — 2± |, it follows that the interval comprising 
its domain is (—5/2,-3/2). 
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Example 9.8.3. The power series /n has interval of convergence (—1, 1) 

and domain [—1,1). Notice it is not absolutely convergent when x = — 1. 

Example 9.8.4. The power series J2^=i x^/n^ has interval of convergence (—1, 1), 
domain [—1, 1] and is absolutely convergent on its whole domain. 

The preceding is summarized in the following theorem. 

Theorem 9.8.1. Let the power series be as in (9.13) and R be given by either 
(9.14) or (9.15). 

(a) If R^O, then the domain of the series is {c}. 

(b) If R > 0 the series converges at x when \c — x\ < R and diverges at x 
when \c — x\ > R. 

(c) If R ^ (0, oo), then the series may converge at none, one or both of c — R 
and c + R. 

9.8.2 Uniform Convergence of Power Series 

The partial sums of a power series are a sequence of polynomials converging 
pointwise on the domain of the series. As has been seen, pointwise convergence 
is not enough to say much about the behavior of the power series. The following 
theorem opens the door to a lot more. 

Theorem 9.8.2. A power series converges absolutely and uniformly on compact 
subsets of its interval of convergence. 

Proof. There is no generality lost in assuming the series has the form of (9.13) 
with c = 0. Let the radius of convergence R > 0 and be a compact sbset of 
{-R,R). Choose r € (lub{|a;| : x e K},R). lixeK, then |a„a:"| < |a„r"| for 
n e N. Since J2^=o k™^"! converges, the Weierstrass M-test shows J2^=o ^nx'^ 
is absolutely and uniformly convergent on K. □ 

The following two corollaries are immediate consequences of Corollary 9.5.2 
and Theorem 9.6.1, respectively. 

Corollary 9.8.3. A power series is continuous on its interval of convergence. 

Corollary 9.8.4. If[a,b] is an interval contained in the interval of convergence 

for the power series X]^o '^nix — c)", then 



The next question is: What about differentiability? 

Notice that the continuity of the exponential function and L'Hospital's Rule 
give 





ji 
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Therefore, for any sequence a„, 

limsup(na„)^/" = lim sup n^/"ay" = limsupay". (9.16) 

Now, suppose the power series X^^o ^n^" has a nontrivial interval of con- 
vergence, /. Formally differentiating the power series term-by-term gives a new 
power series X^^i na„a;"~^. According to (9.16) and Theorem 9.8.1, the term- 
by-term differentiated series has the same interval of convergence as the original. 
Its partial sums are the derivatives of the partial sums of the original series and 
Theorem 9.8.2 guarantees they converge uniformly on any compact subset of /. 
Corollary 9.7.3 shows 

^ oo oo ^ oo 

— a„a;" = —anX^ = na„x"~"^, Va; e /. 
dx ^ ^ ax ^ 

n— 0 n— 0 n— 1 

This process can be continued inductively to obtain the same results for all 
higher order derivatives. We have proved the following theorem. 

Theorem 9.8.5. 7/ f{x) = Y^^o ^n{x — c)" is a power series with nontrivial 
interval of convergence, I, then f is differentiable to all orders on I with 

oo I 
n=m ^ ^ 

Moreover, the differentiated series has I as its interval of convergence. 



9.8.3 Taylor Series 

Suppose f{x) = J2^=o "nx'"' has / ~ {—R, R) as its interval of convergence for 
some R> 0. According to Theorem 9.8.5, 

/(H(o) = am ^ am = P, Vm e a;, 

(m — to)! to! 

Therefore, 

This is a remarkable result! It shows that the values of / on 7 are completely 
determined by its values on any neighborhood of 0. This is summarized in the 
following theorem. 

Theorem 9.8.6. If a power series f{x) = Y^^=o^n{x — c)" has nontrivial 
interval of convergence I, then 

/(^) = E ^-^(^ - Va; e 7. (9.18) 

n! 

n=0 
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The scries (9.18) is called the Taylor series'^ for / centered at c. The Taylor 
series can be formally defined for any function that has derivatives of all orders 
at c, but, as Example 7.5.2 shows, there is no guarantee it will converge to the 
function anywhere except at c. 

9.8.4 The Endpoints of the Interval of Convergence 

We have seen that at the endpoints of its interval of convergence a power series 
may diverge or even absolutely converge. A natural question when it does 
converge is the following: What is the relationship between the value at the 
endpoint and the values inside the interval of convergence? 

Theorem 9.8.7 (Abel). // f{x) = J2'^=o'^n{x — c)" has a finite radius of 
convergence R> 0 and f{c + R) exists, then c + R G C{f). 

Proof. It can be assumed c = 0 and R = 1. There is no loss of generality with 
either of these assumptions because otherwise just replace f{x) with f{{x+c)/R). 
Set s = /(I), s_i = 0 and s„ = Xlfe=o '^fc new. For |a;| < 1, 



'^Ukx'' = ^(sfe - Sk-i)x'' 

k=a 

n n 

n—1 n—1 

k=0 fc=0 

n-1 

Snx"- + (1 - a;) ^ Skx'' 



fe=o 



When n — )• oo, since s„ is bounded, 

•DO 

f{x) = {l-x)Y,Skx''. (9.19) 

fe=0 

Since (1 - x) J2n=o a^" = 1' (9-19) implies 



l/W-«l = , 

fc=0 

Let £ > 0. Choose iV e N such that \sn - s\ < e/2, S G (0, 1) so 

N 

SY,K-s\ <e/2. 

fe=0 



(9.20) 



When c = 0, it is often called the Maclaurin series for /. 
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and 1 — ^ < a; < 1. With these choices, (9.20) becomes 



l/W-s|< 



N 



(l-a;)^(sfc-s)x* 



/s=0 



+ 



k=N+l 



N 



<SY,\sk-s\ + ^ (l-x) ^* 

fe=0 fe=JV+l 

It has been shown that hmj:-|-i f{x) = /(I), so 1 e C(/). 



e £ 
<2+2=^ 



□ 



Abel's theorem opens up a more general idea for the summation of scries. 
Suppose, as in the proof of the theorem, fix) = X]^o '^nx" with interval of 
convergence (—1, 1). If lim2,-|-i f{x) exists, then this limit is called the Abel sum 
of the coefficients of the series. In this case, the following notation is used 



lim/(a;) = A Va„. 

From Theorem 9.8.7, it is clear that when X^^q ^" exists, then X^^q = 
A J2'^=o ■ The converse of this statement may not be true. 
Example 9.8.5. Notice that 



n— 0 



has interval of convergence (—1, 1) and diverges when x = 1. (It is the sum 
1-1 + 1-1 + -- -.) But, 



A V(-l)" =lim-^ = \. 

■H; xni + x 2 



Finally, here is an example showing the power of these techniques. 
Example 9.8.6. The series 



oo 



n=0 



has (—1, 1) as its interval of convergence and (0, 1] as its domain. If 0 < |a;| < 1, 
then Corollary 9.6.2 justifies 



arctan(a;) = 



dt 



0 1 + 



11=0 



n=0 



2n + 1 



This series for the arctangent converges by the alternating series test when x = 1, 
so Theorem 9.8.7 implies 



E 

n=0 



(-1)" T / N 

= limarctania;) = arctan(l) = — . 

2n+l xfi ^ ' ^ ' A 



(9.21) 
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Chapter 10 

Fourier Series 



In the late eighteenth century, it was weh-known that comphcated functions 
could sometimes be approximated by a sequence of polynomials. Some of 
the leading mathematicians at that time, including such luminaries as Daniel 
Bernoulli, Euler and d'Alembert studied the possibility of using sequences of 
trigonometric functions for approximation. In 1807, this idea opened into a 
huge area of research when Joseph Fourier used series of sines and cosines to 
solve several outstanding partial differential equations of physics.^ 
In particular, he used series of the form 

oo 

a„ COS nx + 6„ sin nx 

to approximate his solutions. Series of this form arc called trigonometric series, 
and the ones derived from Fourier's methods are called Fourier series. Much of 
the mathematical research done in the nineteenth and early twentieth century 
was devoted to understanding the convergence of Fourier series. This chapter 
presents nothing more than the tip of that huge iceberg. 

10.1 Trigonometric Polynomials 

Definition 10.1.1. A function of the form 

n 

p{x) = ak COS kx + Pk sin kx (10-1) 
fe=o 

is called a trigonometric polynomial. The largest value of k such that |afe | +/3a; | ^ 
0 is the degree of the polynomial. Denote by T the set of all trigonometric 
polynomials. 

^Fourier's methods can be seen in most books on partial differential equations, such as [3]. 
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Evidently, all functions in T arc 27r-pcriodic and T is closed under addition 
and multiplication by real numbers. Indeed, it is a real vector space, in the 
sense of linear algebra and the set {sinna; : n e N} U {cosna; : n e w} is a basis 
for T. 

The following theorem can be proved using integration by parts or trigono- 
metric identities. 

Theorem 10.1.1. Ifm,n G Z, then 

sin mx cos nx dx = 0, (10.2) 



/7I 



sin mx sin nx dx 



0, m ^ n 

0, m = 0 or n = 0 

TT m = n^O 



and 



i: 



cos mx cos nxdx = < 



0, m^n 
2-K m = n = 0 
n m = n^O 



If p{x) is as in (10.1), then Theorem 10.1.1 shows 

/TT 
p{x) dx, (3o = 0 
-TT 

and for n > 0, 

/TT pTT 

p{x)cosnxdx, 7Tl3n= / p{x)cosnxdx. 

-IT J — TT 

Combining these, it follows that if 

ttn = — p{x) COS nx dx and 6„ = — / p{x) sin nx dx 



for n e w, then 



P{^) = + 5^ '^n cos nx + bn sin nx. 



n=l 



(10.3) 



(10.4) 



(10.5) 



(Remember that all but a finite number of the a„ and 6„ arc 0!) 

At this point, the logical question is whether this same method can be used 
to represent a more general 27r-periodic hmction. For any function /, integrable 
on [—• 7r,7r], the coefficients can be defined as above; i.e., for new, 

1 r 1 r 

an= — f{x)cosnxdx and bn = — f{x)smnxdx. (10.6) 
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The problem is whether and in what sense an equation such as (10.5) might 
be true. This turns out to be a very deep and difficult question with no short 
answer.^ Because we don't know whether equality in the sense of (10.5) is true, 
the usual practice is to write 

f{x) ^ — + ^y^anCOsnx + bnSinnx, (lO-''') 

indicating that the series on the right is calculated from the function on the left 
using (10.6). The series is called the Fourier series for /. 

There are at least two fundamental questions arising from (10.7): Does the 
Fourier series of / converge to /? Can / be recovered from its Fourier series, 
even if the Fourier series does not converge to /? These are often called the 
convergence and representation questions, respectively. The next few sections 
will give some partial answers. 



10.2 The Riemann Lebesgue Lemma 

We learned early in our study of series that the first and simplest convergence 
test is to check whether the terms go to zero. For Fourier series, this is always 
the case. 

Theorem 10.2.1 (Riemann-Lebesgue Lemma). If f is a function such that 
f exists, then 

lim / f (t) cos atdt = 0 and lim / f (t) sinatdt = 0. 

Proof. Since the two limits have similar proofs, only the first will be proved. 
Let e > 0 and P be the generic partition of [a, b] satisfying 

For rrii = gib {f{x) : x^-i < x < Xj}, define a function g on [a, b] by g{x) = rrii 
when Xi-i < x < Xi and g{b) = rUn- Note that J^^g = 1l{f, P) so 

0 </>-.)<£. 

^Many people, including me, would argue that the study of Fourier series has been the most 
important area of mathematical research over the past two centuries. Huge mathematical 
disciplines, including set theory, measure theory and harmonic analysis trace their lineage 
back to basic questions about Fourier series. Even after centuries of study, research in this 
area continues unabated. 



June 2, 2014 



http;// math.louisville.edu/^lee/ira 



10-4 



CHAPTER 10. FOURIER SERIES 



Since f > g, 

l-b 



f{t) cos at dt 



{f{t) — g{t)) cos at dt + / g{t) cos at dt 



< 



if{t) - g{t)) cos at dt 



g(t) cos at dt 



< / if-9) + 



1 " 

EjTij {(sm(axi) — sin(axi„i)) 



< 



The first term on the right is less than e/2 and the second can be made less 
than e/2 by choosing a large enough. □ 

Corollary 10.2.2. // / is integrable on [— 7r,7r] with an and bn the Fourier 
coefficients of f , then a„ — s- 0 and fe„ — >■ 0. 



10.3 The Dirichlet Kernel 

Suppose / is integrable on [— tt, tt] and 27r-periodic on M, so the Fourier series of 
/ exists. The partial sums of the Fourier series are written as s„(/, x), or more 
simply Sn{x) when there is only one function in sight. To be more precise, 

n 

s„(/, a;) = Y + ^ (flfc COS kx + bk sin kx) . 

k=l 

We begin with the following calculation. 



Sn {x) ^ ~^ + i'^k COS kx + bk sin kx) 
fc=i 

= — / f{t)dt + ''^^— / {f (t) COS kt cos kx + f{t) sin kt sin kx) dt 

= — / /(t) (1 -I- 2(cos fc^ COS fc.T + sinfctsinfcg:)) dt 
^ -^^^ k=i 

= 7r /(t)5I(l + 2cosfc(a;-t)) 
^'^ k=i 

Substitute s = x — t and use the assumption that / is 27r-periodic. 



/ /(x-s) f 1 + 2 Vcosfcs I ds 



(10.8) 
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15/ 






1/2 






/ ^ 






1 X \ ^--"'x \ \ 1 \ j 1 




1 \ / \ / 1 / ""■"■--^ / A 1 








\J -3 







Figure 10.1: The Dirichlet kernel -D„(s) for n = 1,4, 7. 



The sequence of trigonometric polynomials from within the integral, 

n 

Dn{s) = 1 + 2 J2 COS ks, (10.9) 
fe=i 

is called the Dirichlet kernel. Its properties will prove useful for determining 
the pointwise convergence of Fourier series. 

Theorem 10.3.1. The Dirichlet kernel has the following properties. 

(a) Dn{s) is an even 2TT-periodic function for each n G N. 

(b) Dn{0) = 2n + l for each n € N. 

(c) \Dnis)\ < 2n + 1 for each n £N and all s. 
1 



(d) 



2ti 



Dn{s) ds = 1 for each n G N. 



sin(n + 1 /2)s 

(e) Dn{s) = ; for each n E N and s not an integer multiple 



of IT. 



sin s/2 



Proof. Properties (a)-(d) follow from the definition of the kernel. 

The proof of property (e) uses some trigonometric manipulation. Suppose 
n G N and s ^ mi: for any m G Z. 

n 

Dn{s) = 1 + 2^COSfc5 
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Use the facts that the cosine is even and the sine is odd. 

n, ^ n 

COS 



= COS ks H — : — J- sin ks 
smf 

k=—n ^ K=—n 

1 " S S 

= > I sin - cos ks + cos - sin ks ) 

sinf V 2 2 / 

^ k=—n 
1 " 1 

— ^ sin(A; + -)s 



«^^ 2 k- 



This is a telescoping sum. 

sin(n + \)3 
sinf 

According to (10.8), 



□ 



SniJ.x) f{x-t)Dn{t)dt. 

This is similar to a situation we've seen before within the proof of the Wcicrstrass 
approximation theorem, Theorem 9.5.3. The integral given above is a convolu- 
tion integral similar to that used in the proof of Theorem 9.5.3, although the 
Dirichlet kernel isn't a convolution kernel in the sense of Lemma 9.5.4. 



10.4 Dini's Test for Pointwise Convergence 



Theorem 10.4.1 (Dini's Test). Let f : M. ^ M. he a 2TT-periodic function in- 
tegrable on [— 7r,7r] with Fourier series given by (10.7). // there is a 6 > 0 and 
s gM. such that 

'■^ f{x + t) + f{x-t)-2s 



/ 

Jo 



t 



dt < DO, 



then 



^ + ^ (afe cos kx + bk cos kx) = s. 



fe=i 



Proof. Since £)„ is even, 

^n{x) = ^J f{x-t)D^{t) 



dt 



~ 27r J_, 



f{x-t)D^{t)dt + 



1 r 

^ Jo 



f{x-t)D^{t)dt 



{f{x + t) + f{x-t))Dn{t)dt. 
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Figure 10.2: This plot shows the function of Example 10.4.1 and s^ix) for that 
function. 



By Theorem 10.3.1(d) and (e), 

s„(s) - s = / (/(.T + + /(x -t)~ 2s) Ai(t) dt 

1 r/(^ + <) + /(^-0-2.s t 1 

' ^ • sm(n + -jtdt. 



27r 



sm : 



Since t/sin | is bounded on (0,7r), Theorem 10.2.1 shows s„(a;) — s — 0. Now 



use Corollary 8.5.2 to finish the proof. 



□ 



Example 10.4.1. Suppose f{x) — x for — tt < x < tt and is 27r-periodic on M. 
Since / is odd, a„ = 0 for all n. Integration by parts gives fe„ = (— l)"+^2/?i for 
n G N. Therefore, 

-i2 



smnx. 



For a; € (— tt, tt), let 0 < (5 < min{7r — x, tt + x}. (This is just the distance from 
x to closest endpoint of (— 7r,7r).) Using Dini's test, we see 



fix + t) + fix -t)- 2x 



t 



dt 



x + t + x — t — 2x 



dt = 0 < CO, 



so 



E^( — 1)"^"'^— sinnx = a; for — n < x < n. 



(10.10) 



ri=l 
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In particular, when x = tt/2, (10.10) gives another way to derive (9.21). When 
X = IT, the series converges to 0, which is the middle of the "jump" for /. 

This behavior of converging to the middle of a jump discontinuity is typical. 
To see this, denote the one-sided limits of / at a; by 

fix-) = lim/(i) and f{x+) = lim/(t), 

tjx ti,x 

and suppose / has a jump discontinuity at x with 

_ f{x-) + f{x+) 
*~ 2 

Guided by Dini's test, consider 

'■'\f{x + t) + f{x-t)-2s 



J 

Jo 



t 



dt 



f(x + t) + f{x-t)-f{x-)-f{x+) 



< 



f{x + t)-f{x+) 



t 







dt- 






Jo 



dt 



fix -t)- fix-) 



t 



dt 



If both of the integrals on the right are finite, then the integral on the left is 
also finite. This amounts to a proof of the following corollary. 

Corollary 10.4.2. Suppose f ■.M.^'K is 2'jT-periodic and integrable on [— tt, tt]. 
// both one-sided limits exist at x and there is a S > 0 such that both 



fix + t)-fix+) 



dt < oo and 



f 

Jo 



fix -t)- fix-) 



dt < oo, 



then the Fourier series of f converges to 

fix-) + fix+) 



The Dini test given above provides a powerful condition sufficient to ensure 
the pointwise convergence of a Fourier series. There is a plethora of ever more 
abstruse conditions that can be proved in a similar fashion to show pointwise 
convergence. 

The problem is complicated by the fact that there are continuous functions 
with Fourier series divergent at a point and integrable functions with Fourier 
series diverging everywhere [11]. Such examples are much too far into the deep 
water to be presented here. 



10.5 The Fejer Kernel 

Since pointwise convergence of the partial sums seems complicated, why not 
change the rules of the game? Instead of looking at the sequence of partial 
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sums, consider a rolling average instead: 

1 " 

(^n{f,x) = — — Vs„(/,a;). 
n + 1 ^ — ' 

fe=o 

The trigonometric polynomials (T„(/, x) are called the Cesaro means of the par- 
tial sums. If lim„_>.oo cr„(/, a;) exists, then the Fourier series for / is said to be 
(C, 1) summable at x. The idea is that this averaging will "smooth out" the 
partial sums, making them more nicely behaved. It is not hard to show that 
if Sn{f,x) converges at some x, then an{f,x) will converge to the same thing. 
But there are sequences for which (T„(/, x) converges and Sn(/, x) does not. (See 
Exercises ?? and 25.) 

As with Sn{x), we'll simply write cr„(x) instead of (T„(/, x), when it is clear 
which function is being considered. 

We start with a calculation. 



1 " 

<^n{x) = —^^^Skix) 

k=0 

= ^— V— / f(x -t)Dk(t)dt 

=^ r f{x-t)^j2Du{t)dt (*) 

_^ r ^ ^ " sin(fc + l/2)t 

^ —T^ 1 — n 



2tt J ^ n + l'^ sint/2 



A;=0 

^ / fi^-t) , . ,„ EsinV2sin(fc + l/2)tdi 

J -TV (n + 1) sm t 2 f—' 



2tt 



Use the identity 2 sin A sin B = cos(yl - B) - cos{A + B) 

I r . 1 



/7r 1 
f(x — t) ^ ^ — — > (cos kt — cos(fc -I- l)t) dt 
^ (n + 1) sin2 1/2 ^ ^ ' ' 



f{x - t)- — ^^^^ (1 - cos(n + l)t) dt 



The sum telescopes. 

I r 1 

27r J_^'^ ' {n + 1) sin^ t/2 
Use the identity 2 sin^ A = 1 — cos 2A. 

2tt + 1) V smf / 
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Figure 10.3: A plot oi K5{t), Ks{t) and Kio{t). 



The Fejer kernel is the sequence of functions highlighted above; i.e., 



Knit) 



1 /sin^i±i< 



(n + 1) \ sin 



, n e N. 



(10.11) 



Comparing the lines labeled (*) and (**) in the previous calculation, we see 
another form for the Fejer kernel is 



1 " 



(10.12) 



k=0 



Once again, we're confronted with a convolution integral containing a kernel: 

anix) = ^ / fix -t) Knit) dt. 



Theorem 10.5.1. The Fejer kernel has the following properties.^ 

(a) Knit) is an even 2TT-periodic function for each n e N. 

(b) KniO) = 71+1 for each n ^ ui. 

(c) Knit) > 0 for each neN. 



^Compare this theorem with Lemma 9.5.4. 
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(d) ^ J-TT ^n{t) dt = 1 for each n G lj. 



(e) IfO<S< IT, then ^ 0 on [—7r,5] U [6,tt]. 

(f) IfO<S < n, then K„{t) dt ^ 0 and K^it) dt 0. 

Proof. Theorem 10.3.1 and (10.12) imply (a), (b) and (d). Equation (10.11) 
implies (c). 

Let S be as in (c). In light of (a), it suffices to prove (e) for the interval [(5, tt]. 
Noting that sint/2 is decreasing on [S,Tr], it follows that tor 6 < t < w, 



1 /sin^t 

Knit) = — ^ — 

(n + 1) \ sm 



1 

< 



2 

t 

2 

2 



2 



< 



(n + 1) Vsin| 

1 1 
(n + 1) sin^ I 



It follows that Kn ^ 0 on [6, tt] and (e) has been proved. 

Theorem 9.6.1 and (e) imply (f). □ 



10.6 Fejer's Theorem 



Theorem 10.6.1 (Fejer). // / : M — >• M is 2n-periodic, integrable on [— 7r,7r] 
and continuous at x, then cr„(a;) f{x). 



Proof. Since / is 27r-pcriodic and / f{t) dt exists, so does / {f{x — t) — 
f{x)) dt. Theorem 8.2.2 gives an M > 0 so |/(.t - t) - f{x)\ < M for all t. 

Let £ > 0 and choose 6 > 0 such that |/(a;) — f{y)\ < e/3 whenever |a; — 2/| < 
S. By Theorem 10.5.1, there is an AT e N so that whenever n> N, 
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We start calculating. 

Wn{x) - f{x)\ 



1 

2^ 



f{x-t)Kn{t)dt 



2-K 



f{x)Kn{t)dt 



1 

2^ 



1 

2^ 



r {f{x-t)-f{x))Kn{t)dt 

J —IT 

I \f{x-t)-f{x))Kn{t)dt+ [\f{x-t)-f{x))Kn{t)dt 

J-TT J-S 



+ 



f{x))K„it)dt 



< 



1 

2^ 



ifix-t)-f{x))Kn{t)dt 



+ 



1 

2^ 



{f{x-t)- f{x))K^{t)dt 



< 



M 
2^ 



^ r { f{x-t)-f{x))Kr.{t)dt 
J Kn{t) dt+^j^ \f{x -t)- f{x)\Kn{t) dt+^£ Knit) dt 

This shows cr„(a;) f{x). □ 
Theorem 10.6.1 gives a partial solution to the representation problem. 

Corollary 10.6.2. Suppose f and g are continuous and 2tt -periodic on R. If 

f and g have the same Fourier coefficients, then they are equal. 

Proof. By assumption, crn{f,t) = an{g,t) for all n and Theorem 10.6.1 implies 
0 = cr„(/, t) - a„{g, t)^ f - g. 

□ 

In the case of continuous functions, the convergence is uniform, rather than 
pointwise. 



Theorem 10.6.3 (Fejer). // / : 



is 2-K-periodic and continuous, then 



Proof. By Exercise 10.3, / is uniformly continuous. This can be used to show 
the calculation within the proof of Theorem 10.6.1 do not depend on x. The 
details are left as Exercise 10.5. □ 

A perspicacious reader will have noticed the similarity between Theorem 
10.6.3 and the Weierstrass approximation theorem. Theorem 9.5.3. In fact, the 
Weierstrass approximation theorem can be proved from Theorem 10.6.3 using 
power series and Theorem 9.8.2. 
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n 




TT 








2 




2 
















2 








V/-'-' 
























—n 









Figure 10.4: These plots illustrate the functions of Example 10.6.1. On the left are 
shown /(a;), ss{x) and as(x). On the right are shown f{x), (J3{x), (T.-,(,r) and 0-20(2;). 
Compare this with Figure 10.2. 

Example 10.6.1. As in Example 10.4.1, let f{x) — x for — vr < a; < tt and 
extend / to be periodic on K with period 2tt. Figure 10.4 shows the difference 
between the Fejer and classical methods of summation. Notice that the Fejer 
sums remain much more smoothly affixed to the function. 

10.7 Exercises 

10.1. Prove Theorem 10.1.1. 

10.2. Suppose / is integrable on [— 7r,7r]. If / is even, then the Fourier series 
for / has no sine terms. If / is odd, then the Fourier series for / has no cosine 
terms. 

10.3. If / : M — > M is periodic with period p and continuous on [0,p], then / is 
uniformly continuous. 

10.4. The function g{t) = tj sin(</2) is undefined whenever t = 2ri7r for some 
n G Z. Show that it can be redefined on the set {2n7r : n e Z} to be periodic 
and uniformly continuous on M. 

10.5. Prove Theorem 10.6.3. 

10.6. Prove the Weierstrass approximation theorem using Taylor series and 
Theorem 9.8.2. 
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thm:riemannlebesgue, 10-3 
topology, 5-2 

finite complement, 5-14 

relative, 5-14 

right ray, 5-2, 5-14 

standard, 5-2 
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totally disconnected set, 5-5 
trigonometric polynomial, 10-1 

unbounded, 2-6 
uniform continuity, 6-13 
uniform metric, 9-6 

Cauchy sequence, 9-6 

complete, 9-6 
unilateral limit, 6-4 

Weierstrass 

Approximation Theorem, 9-8, 10- 
12 

M-Test, 9-7 



